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ATGGGTTATA 
TTTAATTCTC 
AAGATGTCAC 
CAGCCTGGCT 
AACAGCGTGG 
CTGATGAAGG 
ATGGGTGGTT 
GGAGCAAGCG 
ATGTGCATCA 
GATGAGGTCA 
ATTCAAAAGC 
GGTTTACGCA 
GAAGATTTAA 
TAATGTGGAG 
AGGATTTACT 
TATTTAATTG 
TATAGCAGGG 
ACACCTTTGA 
GTATTAGGCG 
TCAGCAAGGG 
ATGAACGTGA 
CGAGAAACTT 
GAGTTGGGAG 
AGGACGGAAA 
AATGAGCTAA 
GGATTATGAA 
CAAGACTAAT 
CAATTTAATT 
GCCAACGGTC 
CCCATTTTAA 
GGAGAATATG 
GGCGTTTTCT 
TCTTTAACGC 
AGCGGAGAGA 
AGGTAGACGG 



CCAAAGCGAA 
TGTTTAAGTG 
TTCGTCACAG 
GGAATTTAGG 
GGGAATCCTC 
GTATAAAAGC 
CTCCAGATTA 
ATAGATTGGG 
TGACTCATGG 
TGGCAAGATA 
CACTCCCATA 
GGAGTGGGGA 
ATAAGACGTT 
CGCCCTTTAG 
AGATCGCTTG 
CCACGGTGCA 
TACACTCATT 
CCGTGTTCAT 
AATTCGGTTT 
GAGAAATTAA 
TATAACCCAT 
ATGCATGGTA 
GGGCGTTCTG 
GCCAATTAGA 
CAGCCTTACA 
CTAGCAGGAG 
TACCCCTGGT 
CTGGAGCAGA 
GAAAATACAG 
TGGTGATAGT 
CTGGGCCGCA 
CCTAATTACG 
GGTACGGGAT 
CGGTGGAATA 
TAA 



GTGTACGTTG 
TGTCAATGTT 
TTGGATATTC 
AAATACGTTT 
GTGTAACAAG 
ATTCGTATCC 
TACGATAAAT 
CGTTGGAGGA 
CTGTGGATGT 
TACAGCTATT 
AGTTGATGTT 
GAGATTCAAG 
CTATTATATT 
TATTGCCTAC 
TATCAAACAA 
TTATTATGGC 
TTGAACAGGA 
AACACATTTA 
GTTAGGCTTT 
AGTTTTTTGA 
ATGTTATGGG 
TGATCAAGAA 
CTACAGCAGA 
GATCAAGATA 
GGCAGGGGAG 
GCGTATTAAC 
CAATTAGGAA 
CTGGCGTTTT 
ATGGCTCAAC 
CTTGCGACGA 
A6ATTGGACG 
CCACAGGGGA 
GATGATATCC 
TACCTTACGT 



AAAAAAACTG 
TGTTCCAATG 
ACTCCTATGT 
GACGCTGTTG 
AGAGTTAATA 
CAGTGACATG 
GAAGATTATA 
AGACTTATAT 
ATGATATGGA 
TGGGAACAAT 
TGAGAGTGTC 
AAAATCATCA 
GTCAGAGAGT 
GATAGAAACA 
TGGAAGACTT 
TTCTGGCCAT 
AACACAACAA 
CAGCGCX3TGG 
GACAAAAGTA 
GTTTCTCATC 
ATAACX3GCCA 
TTTCATGACA 
GTCTAATTTG 
TACAGCTTTA 
GAATCGCTTG 
GCTAAAAGCG 
CCAATGCAGT 
CAATTACAGA 
ATGGCATTTT 
TGGAAGCTGT 
TCATTTAAAG 
AATTATTATA 
ATTTAACATT 
AAAAATGGCA 



TCTTGTTTGG 
ACATCAGCTG 
AGCTGACATG 
GAGATGATGA 
AAAACGATTG 
GCAAAATCAA 
TCAATCGGGT 
GTGATGTTAA 
ACATAACTAT 
TGTCGGAAAA 
AATGAGCCTA 
TGCTTACTTA 
CAGGAGGCAA 
GCCACGTCTC 
GGATGATCCT 
TTAGTGTCAA 
GATATTATAG 
TGTCCCAGTT 
CGGATGTGAT 
CATCATCTCA 
GCATTTAAAT 
TATTAAAAGC 
ATTCATGTGA 
CTTAAACGGA 
TTCTAGGAGA 
GACACCCTCA 
CATCACAGCA 
ATGTGGACGT 
GCGATCCCTA 
TTATGCAAAC 
AATTTGGCGA 
TCAGAAGCCT 
TCATTTTTGG 
ATTATGTTCA 
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invention provides a novel 
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corresponding BagCel amino 
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and host cells comprising a 
nucleic acid sequence encoding 
BagCel, recombinant BagCel 
proteins and methods for 
producing the same. 
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NOVEL BACH LUS BaaCel CELLULASE 



CROSS-REFERENCE TO RELATED APPLICATIONS 

(Oil This application claims priority to U.S. Provisional Application No. 60/467.255 filed 
5 April 30. 2003 (Attorney Docket No. GC798P). which is herein incorporated in its entirety by 
reference. 

STATEMENT AS TO RIGHTS TO INVENTIONS MADE UNDER FEDERALLY 
SPONSORED RESEARCH AND DEVELOPMENT 

10 [02] Not applicat)le. 

FIELD OF THE INVENTION 

[031 This invention relates to a novel cellulase referred to herein as BagCel. Also 
described are nucleic acids encoding the cellulase. compositions comprising said cellulase. 
methods of identifying novel cellulases and methods of using said compositions. Preferably 

1 5 the cellulase(s) are isolated from Bacillus species, preferably B. agaradhaerens. The 

present invention further relates to the use of the novel cellulase in compositions recognized 
in the art as advantageously having cellulase added thereto, including, as an additive in a 
detergent composition, in the treatment of cellulose containing fabrics, in the treatment of 
pulp and paper and in the treatment of starch for the production of high fructose com-syrup 

20 or ethanol. 

BACKGROUND OF THE INVENTION 
1041 Cellulose and hemicellulose are the most abundant plant materials produced by 
photosynthesis. They can be degraded and used as an energy source by numerous 
microorganisms, including bacteria, yeast and fungi, that produce extracellular enzymes 

25 capable of hydrolysis of the polymeric substrates to monomeric sugars (Aro et al.. 2001 ). As 
the limits of non-renewable resources approach, the potential of cellulose to become a major 
renewable energy resource is enormous (Krishna ef a/.. 2001). The effective utilization of 
cellulose through biological processes is one approach to overcoming the shortage of foods, 
feeds, and fuels (Ohmiya etal., 1997). 

30 [051 Cellulases are enzymes that hydrolyze cellulose (beta-1 ,4-glucan or beta D- 

glucosidic linkages) resulting in the forination of glucose, cellobiose. cellooligosaccharides. 
and the like. Cellulases have been traditionally divided into three major classes: 
endoglucanases (EC 3.2.1.4) ("EG"), exoglucanases or cellobiohydrolases (EC 3.2.1.91) 
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("CBH") and beta-glucosidases ([beta] -D-glucx)side glucohydrolase; EC 3.2.1.21) ("BG"). 
(Knowles et ai, 1987; Shulein, 1988). Endoglucanases act mainly on the amorphous parts 
of the celiutose fibre, whereas cellobiohydrolases are also able to degrade crystalline 
cellulose (Nevalainen and Penttita, 1995). Thus, the presence of a cellobiohydrolase in a 
cellulase system is required for efficient solubilization of crystalline cellulose (Suumakki, et 
aL 2000). Beta-glucosidase acts to liberate D-glucose units from cellobtose, cello- 
oligosaccharides, and other glucosides (Freer, 1993). 

[061 In order to efficiently convert crystalline cellulose to glucose the complete cellulase 
system comprising components from each of the CBH, EG and BG classifications is 
required, with isolated components less effective in hydrolyzing crystalline cellulose (Filho et 
aL, 1996). A synergistic relationship has been observed between cellulase components 
from different classifications. In particular, the EG-type cellulases and CBH- type cellulases 
synergistically interact to more efficiently degrade cellulose. See, e.g.. Wood. 1985. 
[07] Although cellulase compositions have been previously described, there remains a 
need for new and improved cellulase compositions for use in household detergents, 
stonewashing compositions or laundry detergents, etc. Cellulases that exhibit improved 
performance are of particular interest. 

BRIEF SUMMARY OF THE INVENTION 

[08] It is an object of the present invention to provide a novel cellulase having beneficial 
properties for use in detergents, treating textiles, biomass conversion and pulp and paper 
manufacturing. 

[091 It is an object of the present invention to provide polypeptides having cellulolytic 
activity and polynucleotides encoding the polypeptides. The polypeptides may improve the 
degradation of cell wall material, e.g., cellulose and/or hemicellulose. The polypeptides may 
also improve the stability or activity of other enzymes involved in the degradation of plant cell 
wall material, e.g., biomass. 

[10] An object of the present invention is to provide a novel cellulase and derivatives 
thereof, methods of producing such cellulases, and compositions comprising such novel 
cellulases. The present invention further relates to the use of the novel cellulase and 
derivatives thereof in compositions recognized in the art as advantageously having cellulase 
added thereto, including, as an additive in a detergent composition, in the treatment of 
textiles such as cellulose-containing fabrics and fibers useful therefor, as an animal feed 
additive, in biomass conversion, in the treatment of pulp and paper and in the treatment of 
starch for the production of high fructose com-syrup or ethanol. 

[11] it is a further object of the present invention to provide for a method of producing a 
novel cellulase via heterologous expression from recombinant host cells. 
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|12I It is yet a further object of the present invention to provide a nucleic acid sequence 
encoding the inventive ceilulase. In one aspect, the nucleic acid and amino acid sequence 
facilitate commercial production of the novel ceilulase and ceilulase compositions of the 
invention. 

5 (131 It is still a further object of the present invention to provide a novel ceilulase having 
excellent properties for use in detergents, treating textiles, as a feed supplement and in pulp 
and paper manufacturing. In a further aspect, the ceilulase finds use in biomass conversion. 
(14J In a first aspect, the invention includes an isolated polynucleotide having a sequence 
which encodes BagCel. a sequence complementary to the BagCel gene coding sequence. 

10 and a composition comprising the polynucleotide. The polynucleotide may be mRNA. DNA. 
cDNA. genomic DNA. or an antisense analog thereof. 

[151 In one embodiment, a BagCel polynucleotide may comprise an isolated nucleic acid 
molecule which hybridizes to the complement of the nucleic acid presented as SEQ ID NO: 1 
under moderate to high stringency conditions, where the nucleic acid molecule encodes a 

15 BagCel polypeptide that exhibits cellulose binding activity. 

[161 The polynucleotide having at least 80%. 85%. 90%. 95%. 98% or more sequence 
identity to the sequence presented as SEQ ID NO:1 may encode a BagCel protein. In a 
specific embodiment, the polynucleotide comprises a sequence substantially identical to 
SEQ ID NO:1 . The invention also contemplates fragments of the polynucleotide, preferably 

20 at least about 15-30 nucleotides in length. 

[171 In a second aspect, a novel ceilulase or a derivative is provided which is obtainable 
from a Bacillus. Preferably, the ceilulase of the invention comprises an amino acid 
sequence according to Figure 2 (SEQ ID NO:2). a fragment, or a derivative thereof, having 
greater than 90% sequence identity, preferably greater than 95% sequence identity and 

25 more preferably greater than 97% sequence identity to an active portion thereto. 

[18] In a third aspect the present invention relates to a nucleic acid construct comprising 
the nucleotide sequence, which encodes for the polypeptide of the invention, operably linked 
to one or more control sequences that direct the production of the polypeptide in a suitable 
host. 

30 [19] The invention further provides recombinant expression vectors containing a nucleic 
acid sequence encoding BagCel or a fragment or splice variant thereof, operably linked to 
regulatory elements effective for expression of the protein in a selected host. In a related 
aspect, the invention includes a host cell containing the vector. 

[20] In a fourth aspect the present invention relates to a recombinant expression vector 
35 comprising the nucleic acid construct of the invention. 

[21] In a fifth aspect the present invention relates to a recombinant host cell comprising 
the nucleic acid construct of the invention. 
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[22] The invention further includes a method for producing BagCet by recombinant 
techniques, by culturing recombinant prokaryotic or eukaryotic host cells comprising nucleic 
acid sequence encoding BagCel under conditions effective to promote expression of the 
protein, and subsequent recovery of the protein from the host cell or the cell culture medium. 
5 [23] In a sixth aspect the present invention relates to a method for producing a 

polypeptide of the invention, the method comprising: (a) cultivating a microorganism, w^ich 
in its wild-type form is capable of producing the polypeptide, to produce the polypeptide; and 
(b) recovering the polypeptide. 

[24] In a seventh aspect the invention provides for an enzymatic composition useful in the 
10 conversion of cellulose to ethanol. In a preferred embodiment the enzymatic composition 
comprises BagCel. The composition may further comprise additional celtulase enzymes 
such as endoglucanases and/or cellbiohydrolases. The composition may be ennched in 
BagCet. 

[25] In one embodiment the invention provides a method of identifying novel enzymes by 
IS isolating total microbial community DNA from an environment, constructing a genomic DNA 

library in E.co//, screening the library for expression of cellulase activity, identifying the 

cellulase gene in the cellulase-positive clone and characterising the novel cellulase enzyme. 

[26] Further provided herein are analytical methods for detecting BagCel nucleic acids 

and BagCel proteins also form part of the invention. 
20 [27] According to yet another embodiment of the invention, a method of transforming a 

suitable microorganism with nucleic acid sequence encoding a cellulase according to the 

invention is provided. A method of producing the cellulase according to the invention from 

said transformed microorganism is provided. 

[28] A further object of the invention is to provide an expression vector particularty 
25 effective in Streptomyces. Streptomyces serve as alternate host cells for the production of 
various proteins and with respect to the expression and production of cellulases may offer a 
number of advantages over Bacillus host cells particularly when cells are grown at a high cell 
density. A preferred expression vector comprises a regulatory polynucleotide sequence 
including a promoter sequence derived from a glucose isomerase gene of Actinoplanes, a 
30 signal sequence derived from a Streptomyces cellulase gene, and a DNA sequence 
encoding a cellulase, particulariy a cellulase according to the invention. 
[29] In a preferred embodiment of the present invention, a full-length cellulase is 
obtainable from Bacillus. 

[30] Other objects, features and advantages of the present invention will become 
35 apparent from the following detailed description. It should be understood, however, that the 
detailed description and specific examples, while indicating preferred embodiments of the 
invention, are given by way of illustration only, since various changes and modifications 
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Within the scope and spirit of the invention will become apparent to one skilled in the art from 
this detailed description. 

BRIEF DESCRIPTION OF THE DRAWINGS 

(311 Figure 1 illustrates the environmental nucleotide sequence (SEQ ID NO:1). 
1321 Figure 2 illustrates a nucleic acid sequence encoding the novel cellulase (SEQ ID 
NO:2). 

133] Figure 3 illustrates the deduced amino acid sequence of the inventive cellulase (SEQ 
IDNO:3). 

DETAILED DESCRIPTION 

1341 The invention will now be described in detail by way of reference only using the 
following definitions and examples. All patents and publications, including all sequences 
disclosed within such patents and publications, referred to herein are expressly incorporated 
by reference. 

1 5 1351 Unless defined othenvise herein, all technical and scientific terms used herein have 
the same meaning as commonly understood by one of ordinary skill in the art to which this 
invention belongs. Singleton, et ai. Dictionary OF Microbiology and Molecular 
BIOLOGY. 2D ED.. John Wiley and Sons. New York (1994). and Hale & Marham. THE Harper 
Collins Dictionary of Biology. Harper Perennial. NY (1991) provide one of skill with a 

20 general dictionary of many of the terms used in this invention. Although any methods and 
materials similar or equivalent to those described herein can be used in the practice or 
testing of the present invention, the preferred methods and materials are described. 
Numeric ranges are inclusive of the numbers defining the range. Unless othenwise 
indicated, nucleic acids are written left to right in 5' to 3' orientation: amino acid sequences 

25 are written left to right in amino to carboxy orientation, respectively. Practitioners are 

particulariy directed to Sambrook ef a/.. 1989. and Ausubel FM et al.. 1993. for definitions 
and terms of the art. It is to be understood that this invention is not limited to the particular 
methodology, protocols, and reagents described, as these may vary. 
[36] The headings provided herein are not limitations of the various aspects or 

30 embodiments of the invention which can be had by reference to the specification as a whole. 
Accordingly, the terms defined immediately below are more fully defined by reference to the 
specification as a whole. 

[371 All publications cited herein are expressly incorporated herein by reference for the 
purpose of describing and disclosing compositions and methodologies which might be used 
35 in connection with the invention. 



5 




wo 2004/099369 PCT/US2004/013175 
I. DEFINITIONS 

[38] "Cellulase,** "cellulolytic enzymes" or "cellulase enzymes" means the inventive 
bacterial endoglucanase described herein. Three different types of cellulase enzymes act 
synergistically to convert cellulose and its derivatives to glucose. 
5 [391 The term "cellulase" refers to a category of enzymes capable of hydrolyzing cellulose 
polymers to shorter cello-oiigosaccharide oligomers, cellobiose and/or glucose. Numerous 
examples of cellulases, such as exoglucanases, exocellobiohydrolases, endoglucanases. 
and glucosidases have been obtained from cellulolytic organisms, particulariy including 
fungi, and bacteria. The enzymes made by these microbes are mixtures of proteins with 

10 three types of actions useful in the conversion of cellulose to glucose: endoglucanases (EG), 
cellobiohydrolases (CBH). and beta-glucosidase. These three different types of cellulase 
enzymes act synergistically to convert cellulose and its derivatives to glucose. 
[40] Many microbes make enzymes that hydrolyze cellulose, including the wood rotting 
fungus Trichoderma, the compost bacteria Thermomonospora, Bacillus, and Cellulomonas] 

15 Streptomyces; and the fungi Humicola, Aspergillus and Fusarium, 

[41] By the term "host ceil" is meant a cell that contains a vector and supports the 
replication, and/or transcription or transcription and translation (expression) of the 
expression construct. Host cells for use in the present invention can be prokaryotic cells, 
such as E. coll, or eukaryotic cells such as yeast, plant, insect, amphibian, or mammalian 

20 cells. In a one embodiment according to the present invention, "host cell" means the cells of 
the genus Bacillus. In another preferred embodiment according to the invention, "host cell" 
means the cells of Streptomyces. A Streptomyces means any bacterial strain that is a 
member of the genus Streptomyces as classified in Buchanan et aL, The Shorter Bergey's 
Manual For Determinative Bacteriology (Williams & Wilkens 1982). Particulariy preferred 

25 strains of Streptomyces include S. lividens, S. rubiginosus, and S. coelicolor. S. livtdens is 
described in Lomovskaya et al., J. Virology 9:258 (1972). However, one of skill will realize 
that any appropriate host cell, e.g., bacterial, fungal, eukaryotic and plant cell may be used. 
[42] The term "recombinant" when used with reference, e.g., to a cell, or nucleic acid, 
protein, or vector, indicates that the cell, nucleic acid, protein or vector, has been modified by 

30 the introduction of a heterologous nucleic acid or protein or the alteration of a native nucleic 
add or protein, or that the cell is derived from a cell so modified. Thus, for example, 
recombinant cells express genes that are not found within the native (non-recombinant) form 
of the cell or express native genes that are otherwise abnormally expressed, under 
expressed or not expressed at all. 

35 [43] The term "secretory signal sequence" denotes a DNA sequence that encodes a 
polypeptide (a "secretory peptide" or "secretory signal peptide") that, as a component of a 
larger polypeptide, directs the larger polypeptide through a secretory pathway of a ceil in 
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Which it is synthesized. The larger peptide is commonly cleaved to remove the secretory 
peptide during transit through the secretory pathway to yield the secretory signal peptide and 
a smaller peptide commonly refen-ed to as the mature polypeptide. 
[441 As used herein, the phrases "whole cellulase preparation" and "whole cellulase 

5 composition" are used interchangeably and refer to both naturally occurring and non- 

naturalty occurring compositions. A "naturally occurring" composition is one produced by a 
naturally occurring source and which comprises, for example, one or more 
cellobiohydrolase-type. one or more endoglucanase-type, and one or more p-glucosidase 
components wherein each of these components is found at the ratio produced by the source. 

1 0 Certain fungi produce complete cellulase systems which Include exo-cellobiohydrolases or 
CBH-type cellulases, endoglucanases or EG-type cellulases and beta-glucosidases or BG- 
type cellulases (Schulein. 1 988). However, sometimes these systems lack CBH-type 
cellulases and bacterial cellulases also typically include little or no CBH-type cellulases. A 
naturally occurring composition is one that is produced by an organism unmodified with 

1 5 respect to the cellulolytic enzymes such that the ratio of the component enzymes is 
unaltered from that produced by the native organism. 

|45] A "non-naturally occumng" composition encompasses those compositions produced 
by: (1 ) combining component cellulolytic enzymes either In a naturally occumng ratio or non- 
naturally occurring, i.e.. altered, ratio; or (2) modifying an organism to overexpress or 
20 underexpress one or more cellulolytic enzyme; or (3) modifying an organism such that at 
least one cellulolytic enzyme is deleted or (4) modifying an organism to express a 
heterologous component cellulolytic enzyme. 

[461 As used herein, the term "promoter" refers to a nucleic acid sequence that functions 
to direct transcription of a downstream gene. The promoter will generally be appropriate to 

25 the host cell in which the target gene is being expressed. The promoter together with other 
transcriptional and translational regulatory nucleic acid sequences (also termed "control 
sequences") are necessary to express a given gene. In general, the transcriptional and 
translational regulatory sequences include, but are not limited to. promoter sequences, 
ribosomal binding sites, transcriptional start and stop sequences, translational start and stop 

30 sequences, and enhancer or activator sequences. The promoter may be the promoter 

nomially associated with the downstream gene or it may be heterologous, i.e.. from another 
gene or another microorganism as long as it function to direct the gene. A preferred 
promoter when the transformation host cell is Bacillus is the ap/t promoter. In one aspect 
the promoter Is an inducible promoter. In one aspect, when the host cell is a filamentous 

35 fungus, the promoter is the T. reesei cbh1 promoter which is deposited in GenBank under 
Accession Number D86235. In another aspect the promoter is a cbh II or xylanase promoter 
from T. reesei. 



7 



wo 2004/099369 



PCT/US2004/013175 



[47] A nucleic acid is *'operably linked" when it is placed into a functional relationship with 
another nucleic acid sequence. For example, DNA encoding a secretory leader, i.e., a 
signal peptide, is operably linked to DNA for a polypeptide if it is expressed as a preprotein 
that participates in the secretion of the polypeptide; a promoter or enhancer is operably 
5 linked to a coding sequence if it affects the transcription of the sequence; or a ribosome 
binding site is operably linked to a coding sequence if it is positioned so as to facilitate 
translation. Generally, "operably linked" means that the DNA sequences being linked are 
contiguous, and, in the case of a secretory leader, contiguous and in reading phase. 
However, enhancers do not have to be contiguous. Linking is accomplished by ligation at 
10 convenient restriction sites. If such sites do not exist, the synthetic oligonucleotide adaptors 
or linkers are used in accordance with conventional practice. 

[48] "DNA construct" or "DNA vector" means a nucleotide sequence which comprises one 
or more DNA fragments encoding the novel cellulase. Included in "DNA vectors" are 
"expression vectors." Typical expression vectors contain regulatory sequences such as, 

IS transcription and translation terminators, transcription and translation initiation sequences, 
signal sequences, and promoters useful for regulation of the expression of the particular 
nucleic acid. The term "promoter" is used in its ordinary sense to refer to a polynucleotide 
sequence involved in the control of the initiation of transcription of a polynucleotide 
sequence encoding a protein. A "signal sequence" refers to a signal peptide or a portion of a 

20 protein that is capable of directing the transport of a desired protein in bioactive form from a 
host. The mature form of an extracellular protein lacks the signal sequence which is cleaved 
off during the secretion process. While not meant to limit the invention, the number of amino 
acid residues in a signal peptide may be between about 5 and about 100 amino acid 
residues. Signal sequence may be modified to provide for cloning sites that allow for the 

25 ligation of DNA or insertion of DNA encoding a cellulase. The vectors optionally comprise 
generic expression cassettes containing at least one independent terminator sequence, 
sequences permitting replication of the cassette in prokaryotes, eukaryotes, or both, (e.g., 
shuttle vectors) and selection markers for both prokaryotic and eukaryotic systems. Vectors 
are suitable for replication and integration in prokaryotes, eukaryotes, or both. See, Giliman 

30 and Smith, Gene 8:81-97 (1979); Roberts et aL, Nature 328:731-734 (1987); Berger and 
Kimmel, Guide To Molecular Cloning Techniques, Methods In Enzymology, Vol 152, 
Academic Press, Inc., San Diego, CA ("Berger"); Scheider, B., et aL, Protein Expr. Purif 
6435:10 (1995); Sambrook et aL MOLECULAR CLONING - A LABORATORY Manual (2nd ED.) 
Vol. 1-3, Cold Springs Harbor Publishing (1989) f'Sambrook"); and Current Protocols in 

35 Molecular Biology, Ausubel et d/.(eds.). Current Protocols, a joint venture between 
Greene Publishing Associates, Inc. and John Wiley & Sons, Inc., (1997 Supplement) 
("Ausubel"). Cloning vectors useful in Streptomyces are known and reference is made to 
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U.S. Patent Nps. 4.338,397; 4.411.994; 4,513.085; 4.513.086; 4,745.056; 5.514,590; and 
5.622.866 and WO88/07079. 

149] As used herein, the term "gene" means the segment of DNA involved in producing a 
polypeptide chain, that may or may not include regions preceding and following the coding 
5 region, e.g. 5' untranslated (5* UTR) or "leader" sequences and 3' UTR or "trailer" 

sequences, as well as intervening sequences (introns) between individual coding segments 
(exons). 

[501 The term "heterologous" when used with reference to portions of a nucleic acid 
indicates that the nucleic acid comprises two or more subsequences that are not normally 

10 found in the same relationship to each other in nature. For instance, the nucleic acid is 

typically recombinantly produced, having two or more sequences, e.g., from unrelated genes 
arranged to make a new functional nucleic acid, e.g.. a promoter from one source and a 
coding region from another source. Similarly, a heterologous protein will often refer to two or 
more subsequences that are not found in the same relationship to each other in nature (e.g., 

1 5 a fusion protein). 

[51] The terms "isolated" or "purified" as used herein refer to a nucleic acid or amino acid 
that is removed from at least one component with which it is naturally associated. 
[52] In the present context, the term "substantially pure polypeptide" means a polypeptide 
preparation which contains at the most 10% by weight of other polypeptide material with 

20 which it is natively associated (lower percentages of other polypeptide material are 

preferred, e.g. at the most 8% by weight, at the most 6% by weight, at the most 5% by 
weight, at the most 4% at the most 3% by weight, at the most 2% by weight, at the most 1 % 
by weight, and at the most 1/2% by weight). Thus, it is preferred that the substantially pure 
polypeptide is at least 92% pure, i.e. that the polypeptide constitutes at least 92% by weight 

25 of the total polypeptide material present in the preparation, and higher percentages are 
preferred such as at least 94% pure, at least 95% pure, at least 96% pure, at least 96% 
pure, at least 97% pure, at least 98% pure, at least 99%. and at the most 99.5% pure. The 
polypeptides disclosed herein are preferably in a substantially pure form. In particular, it is 
preferred that the polypeptides disclosed herein are in "essentially pure form", i.e. that the 

30 polypeptide preparation is essentially free of other polypeptide material with which it is 

natively associated. This can be accomplished, for example, by preparing the polypeptide 
by means of well-known recombinant methods. Herein, the term "substantially pure 
polypeptide" is synonymous with the terms "isolated polypeptide" and "polypeptide in 
isolated form". 

35 [53] In general, nucleic acid molecules which encode the BagCel will hybridize, under 
moderate to high stringency conditions to the sequence provided herein as SEQ ID NO:2 
(the BagCel). However, in some cases a BagCel-encoding nucleotide sequence is 
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employed that possesses a substantially different codon usage, while the protein encoded 
by the BagCel'-encoding nucleotide sequence has the same or substantially the same amino 
acid sequence as the native protein. For example, the coding sequence may be modified to 
facilitate faster expression of BagCel in a particular prokaryotic or eukaryotic expression 
5 system, in accordance with the frequency with which a particular codon is utilized by the 
host. Te'o, ef a/. (2000), for example, describes the optimization of genes for expression in 
filamentous fungi. 

[54] A nucleic acid sequence is considered to be "selectively hybridizable" to a reference 
nucleic acid sequence if the two sequences specifically hybridize to one another under 

10 moderate to high stringency hybridization and wash conditions. Hybridization conditions are 
based on the melting temperature (Tm) of the nucleic acid binding complex or probe. For 
example, "maximum stringency" typically occurs at about Tm-S'^C (S** below the Tm of the 
probe); "high stringency" at about 5-1 0** below the Tm; "moderate " or "intermediate 
stringency" at about 10-20"* below the Tm of the probe; and "low stringency" at about 20-25*' 

15 below the Tm. Functionally, maximum stringency conditions may be used to identify 

sequences having strict identity or near-strict identity with the hybridization probe; while high 
stringency conditions are used to identify sequences having about 80% or more sequence 
identity with the probe. 

[55] Moderate and high stringency hybridization conditions are well known in the art (see, 
20 for example, Sambrook. et al, 1 989. Chapters 9 and 1 1 , and in Ausubel, F.M., et aL, 1993, 
expressly incorporated by reference herein). An example of high stringency conditions 
includes hybridization at about 42^C in 50% formamide. 5X SSC, 5X Denhardt's solution, 
0.5% SDS and 100 ^g/ml denatured earner DNA followed by washing two times in 2X SSC 
and 0.5% SDS at room temperature and two additional times in 0.1X SSC and 0.5% SDS at 



[56] As used herein, the terms "transformed", "stably transformed" or "transgenic" with 
reference to a cell means the cell has a non-native (heterologous) nucleic acid sequence 
integrated into its genome or as an episomal plasmid that is maintained through multiple 
generations. 

30 [571 As used herein, the term "expression" refers to the process by which a polypeptide is 
produced based on the nucleic acid sequence of a gene. The process includes both 
transcription and translation. 

[58] The term "introduced" in the context of inserting a nucleic acid sequence into a cell, 
means "transfection", or "transformation" or "transduction" and includes reference to the 
35 incorporation of a nucleic acid sequence into a eukaryotic or prokaryotic cell where the 
nucleic acid sequence may be incorporated into the genome of the cell (for example, 
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42**C. 
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Chromosome, plasmid. plastid, or mitochondrial DNA). converted into an autonomous 
replicon, or transiently expressed (for example, transfected mRNA). 

[59J It follows that the term " BagCel expression" refers to transcription and translation of 
the BagCel cellulase gene, the products of which include precursor RNA, mRNA. 
polypeptide, post-translationally processed polypeptides. By way of example, assays for 
BagCel expression include Western blot for BagCel protein. Northern blot analysis and 
reverse transcriptase polymerase chain reaction (RT-PCR) assays for BagCel mRNA. and 
endoglucanase activity assays as described in Shoemaker SP. and Brown R.D.Jr. (Biochim. 
Biophys. Acta. 1978. 523:133-146) and Schulein (1988). 

[60] As used herein, the term "surfactant" refers to any compound generally recognized in 
the art as having surface active qualities. Thus, for example, surfactants comprise anionic, 
cationic and nonionic surfactants such as those commonly found in detergents. Anionic 
surfactants include linear or branched alkylbenzenesulfonates; alkyi or alkenyl ether sulfates 
having linear or branched alkyI groups or alkenyl groups; alkyI or alkenyl sulfates: 
olefinsulfonates; and alkanesulfonates. Ampholytic surfactants include quaternary 
ammonium salt sulfonates, and betaine-type ampholytic surfactants. Such ampholytic 
surfactants have both the positive and negative charged groups in the same molecule. 
Nonionic surfactants may comprise polyoxyalkylene ethers, as well as higher fatty add 
alkanolamides or alkylene oxide adduct thereof, fatty acid glycerine monoesters. and the 
like. 

|61) As used herein, the term "cellulose containing fabric" refers to any sewn or unsewn 
fabrics, yarns or fibers made of cotton or non-cotton containing cellulose or cotton or non- 
cotton containing cellulose blends including natural cellulosics and manmade celluloslcs 
(such as jute, flax, ramie, rayon, and lyocell). 

162] As used herein, the temi "cotton-containing fabric" refers to sewn or unsewn fabrics, 
yams or fibers made of pure cotton or cotton blends Including cotton woven fabrics, cotton 
knits, cotton denims, cotton yams, raw cotton and the like. 

[631 As used herein, the term "stonewashing composition" refers to a fomiulation for use 
in stonewashing cellulose containing fabrics. Stonewashing compositions are used to 
modify cellulose containing fabrics prior to sale. i.e.. during the manufacturing process. In 
contrast, detergent compositions are intended for the cleaning of soiled garments and are 
not used during the manufacturing process. 

[64] As used herein, the term "detergent composition" refers to a mixture which Is 
intended for use in a wash medium for the laundering of soiled cellulose containing fabrics. 
In the context of the present invention, such compositions may include, in addition to 
cellulases and surfactants, additional hydrolytic enzymes, builders, bleaching agents, bleach 
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activators, bluing agents and fluorescent dyes, caking inhibitors, masking agents, cellulase 
activators, antioxidants, and solubilizers. 

165] As used herein, the terms "active" and "biologically active" refer to a biological activity 
associated with a particular protein and are used interchangeably herein. For example, the 
5 enzymatic activity associated with a protease is proteolysis and, thus, an active protease has 
proteolytic activity. It follows that the biological activity of a given protein refers to any 
biological activity typically attributed to that protein by those of skill in the art. 
[66] When employed in enzymatic solutions, the BagCel component is generally added in 
an amount sufficient to allow the highest rate of release of soluble sugars from the biomass. 

10 The amount of BagCel component added depends upon the type of biomass to be 
saccharified which can be readily determined by the skilled artisan. However, when 
employed, the weight percent of the BagCel component relative to any other cellulase type 
components present in the cellulase composition is from preferably about 1, preferably about 
5, preferably about 10, preferably about 15, or preferably about 20 weight percent to 

15 preferably about 25, preferably about 30, preferably about 35, preferably about 40, 

preferably about 45 or preferably about 50 weight percent. Furthermore, preferred ranges 
may be about 0.5 to about 15 weight percent, about 0.5 to about 20 weight percent, from 
about 1 to about 10 weight percent, from about 1 to about 15 weight percent, from about 1 to 
about 20 weight percent, from about 1 to about 25 weight percent, from about 5 to about 20 

20 weight percent, from about 5 to about 25 weight percent, from about 5 to about 30 weight 
percent, from about 5 to about 35 weight percent, from about 5 to about 40 weight percent, 
from about 5 to about 45 weight percent, from about 5 to about 50 weight percent, from 
about 10 to about 20 weight percent, from about 10 to about 25 weight percent, from about 
10 to about 30 weight percent, from about 10 to about 35 weight percent, from about 10 to 

25 about 40 weight percent, from about 10 to about 45 weight percent, from about 10 to about 
50 weight percent, from about 15 to about 20 weight percent, from about 15 to about 25 
weight percent, from about 15 to about 30 weight percent, from at)out 15 to about 35 weight 
percent, from about 15 to about 30 weight percent, from about 15 to about 45 weight 
percent, from about 15 to atx>ut 50 weight percent. 

30 II. MOLECULAR BIOLOGY 

[67| This invention relies on routine techniques in the field of recombinant genetics. Basic ' 
texts disclosing the general methods of use in this invention Include Sambrook ef a/.. 
Molecular Cloning, A Laboratory Manual (2nd ed. 1 989); Kriegler. Gene Transfer and 
Expression: A Laboratory Manual (1990); and Ausubel et ai, eds.. Current Protocols in 

35 Molecular Biology ( 1 994)). 
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[68] To obtain high level expression of a cloned gene, the heterologous gene is preferably 
positioned about the same distance from the promoter as is in the naturally occurring 
cellulase gene. As is known in the art, however, some variation in this distance can be 
accommodated without loss of promoter function. 
5 1691 Those skilled in the art are aware that a natural promoter can be modified by 
replacement, substitution, addition or elimination of one or more nucleotides without 
changing its function. The practice of the invention encompasses and is not constrained by 
such alterations to the promoter. 

[70] The expression vector/construct typically contains a transcription unit or expression 
10 cassette that contains all the additional elements required for the expression of the 

heterologous sequence. A typical expression cassette thus contains a promoter operably 
linked to the heterologous nucleic acid sequence and signals required for efficient 
polyadenylation of the transcript, ribosome binding sites, and translation termination. 
Additional elements of the cassette may include enhancers and, if genomic DNA is used as 
15 the structural gene, introns with functional splice donor and acceptor sites. 

[71] The practice of the invention is not constrained by the choice of promoter in the 
genetic construct. The only constraint on the choice of promoter is that it is functional in the 
host cell used. A preferred promoter when the transformation host cell is Bacillus is the apr€ 
promoter. 

20 [72] In addition to a promoter sequence, the expression cassette should also contain a 
transcription termination region downstream of the structural gene to provide for efficient 
termination. The termination region may be obtained from the same gene as the promoter 
sequence or may be obtained from different genes. 

[73] The particular expression vector used to transport the genetic information into the cell 
25 is not particulariy critical. Any of the conventional vectors used for expression in eukaryotic 
or prokaryotic cells may be used. Standard bacterial expression vectors include 
bacteriophages A and Ml 3. as well as plasmids such as pBR322 based plasmids. pSKF, 
pET23D, and fusion expression systems such as MBP, GST, and LacZ. Epitope tags can 
also be added to recombinant proteins to provide convenient methods of isolation, e.g., c- 



[74] The elements that are typically included in expression vectors also include a replicon, 
a gene encoding antibiotic resistance to penmit selection of bacteria that harbor recombinant 
plasmids. and unique restriction sites in nonessential regions of the plasmid to allow 
insertion of heterologous sequences. The particular antibiotic resistance gene chosen is not 
35 critical, any of the many resistance genes known in the art are suitable. 

[75] The methods of transformation of the present invention may result in the stable 
integration of all or part of the transformation vector into the genome of the filamentous 
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fungus. However, transformation resulting in the maintenance of a self-replicating extra- 
chromosomal transformation vector is also contemplated. 

(76J The gene encoding the cellulase of the present Invention can be cloned using X- 
phage (expression) vectors and £. co// host cells. (Alternatively PGR cloning using 
5 consensus primers designed on conserved domains may be used.) Applicants have 

discovered that transformation of the gene encoding the cellulase of the present invention 
and expression in E. coli results in an active protein. After a first cloning step in E. coli, a 
cellulase gene according to the present invention can be transferred to a more preferred 
industrial expression host such as Bacillus or Streptomyces species, a filamentous fungus 

10 such as Aspergillus or Trichoderma, or a yeast such as Saccharomyces. High level 

expression and secretion obtainable in these host organisms allows accumulation of the 
cellulase in the fermentation medium from which it can subsequently be recovered. 
|77] A preferred general transformation and expression protocol for protease deleted 
Bacillus strains is provided in Ferrari et al., U.S. Patent No. 5.264.366, incorporated herein 

15 by reference. Transformation and expression in Aspergillus is described in, for example, 
Berka et al., U.S. Patent No. 5,364,770, incorporated herein by reference. 
[78] Many standard transfection methods can be used to produce Trichoderma reesei ceW 
lines that express large quantities of the heterologus protein. Some of the published 
methods for the introduction of DNA constructs into cetlulase-producing strains of 

20 Trichoderma include Lorito, Hayes. DiPietro and Herman, 1993, Curr. Genet. 24: 349-356; 
Goldman, VanMontagu and Herrera-Estrella, 1990, Curr. Genet. 17:169-174; Penttila, 
Nevalainen. Ratto, Salminen and Knowles, 1987, Gene 6: 155-164, for Aspergillus Yelton, 
Hamer and Timberlake, 1984, Proc. Natl. Acad. Sci. USA 81: 1470-1474, for Fusarium 
Bajar, Podila and Kolattukudy, 1991, Proc. NaO. Acad. Sci. USA 88: 8202-8212, for 

25 Streptomyces Hopwood et al., 1985, The John Innes Foundation, Norwich, UK and for 

Bacillus Brigidi, DeRossi, Bertarini, Riccardiand Matteuzzi, 1990. FEMS Microbiol. Lett. 55: 
135-138), all incorporated herein by reference. 

[791 However, any of the well-known procedures for introducing foreign nucleotide 
sequences into host cells may be used. These include the use of calcium phosphate 

30 transfection, polybrene, protoplast fusion, electroporation, biolistics, liposomes. 

microinjection, plasma vectors, viral vectors and any of the other well known methods for 
introducing cloned genomic DNA, cDNA, synthetic DNA or other foreign genetic material into 
a host cell (see, e.g., Sambrook et al., supra). Also of use is the Agrobacterium-mediated 
transfection method described in U.S. Patent No. 6,255,1 15. It is only necessary that the 

35 particular genetic engineering procedure used be capable of successfully introducing at least 
one gene into the host cell capable of expressing the heterologous gene. 

14 
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[80] After the expression vector is introduced into the cells, the transfected cells are 
cultured under conditions favoring expression of genes under control of cellulase gene 
promoter sequences. Large batches of transformed cells can be cultured as described 
below. Finally, product is recovered from the culture using standard techniques. 
1811 Thus, the invention herein provides for the expression and enhanced secretion of the 
inventive cellulases whose expression is under control of cellulase gene promoter 
sequences including naturally occurring cellulase genes, fusion DNA sequences, and 
various heterologous constructs. The invention also provides processes for expressing and 
secreting high levels of the inventive cellulases. 

III. Identification of Nucleic Acids and Encoded Protein Sequences 
1821 A genomic library from Bacillus agaradhaerans (DSM 8721 ) was prepared using 
standard techniques known in the art. This organism produces an alkaline cellulase. (endo- 
1.4-beta-glucanase). belonging to cellulase family 5 of glycosyl hydrolases, endoglucanase 
15 5A. EC 3.2.1.4. Swiss-Prot : 085465, entry name GUN5_BACAG. EBI accession number 
AF067428) the gene for which is 1203bp in length. (Davies et al. 1998). Cellulase positive 
clones were detected with an incidence of 1/3000 in the plate assay. In the process for 
isolating a gene according to an aspect of the present invention, degenerate primers based 
on the coding sequence for this enzyme were used. Unexpectedly, however, no PCR 
20 product was obtained using primers known to amplify the known B. agaradhaerans cellulase. 
The complete sequence of the insert coding for the cellulase was therefore detennlned by 
primer walking. 

[83] The process for isolating a gene according to the second aspect of the present 
invention makes use of its homology to a nucleotide sequence comprising all or part of the 
25 nucleotide sequence of SEQ ID NO: 2 as shown in the sequence listing. Examples of such 
processes include: 

a) screening a gene library which presumably contains a BagCel gene using the 
nucleotide sequence as a probe. 

b) preparing a primer based on the nucleotide sequence information, then 

30 performing PCR using a sample which presumably contains a BagCel gene 

as a template. 
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[84} More specifically, process a) above comprises: 

a) preparing a gene library which presumably contains a cellulase gene, 

screening the gene library using a nucleotide sequence comprising all or part 



[85] The gene library may be a genomic DNA library or a cDNA library, and may be 
prepared according to a known procedure. 

IV. PROTEIN EXPRESSION 
15 [86J Proteins of the present invention are produced by culturing cells transformed with an 
expression vector containing the inventive cellulase gene whose expression is under control 
of promoter sequences. The present invention is particularly useful for enhancing the 
intracellular and/or extracellular production of proteins. The protein may be homologous or 
heterologous. 

20 [87] Proteins of the present invention may also be modified in a way to form chimeric 
molecules comprising a protein of interest fused to another, heterologous polypeptide or 
amino acid sequence. In one embodiment, such a chimeric molecule comprises a fusion of 
the protein of interest with a tag polypeptide which provides an epitope to which an anti-tag 
antibody can selectively bind. The epitope tag is generally placed at the amino-or carboxyl- 

25 terminus of the protein of interest. 

[88] Various tag polypeptides and their respective antibodies are well known in the art. 
Examples include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; HIS6 
and metal chelation tags, the flu HA tag polypeptide and its antibody 12CA5 (Field et aL, 
Mol. Cell. Biol. 8:2159-2165 (1988)); the c-myc tag and the 8F9, 3C7. 6E10. G4. B7 and 

30 9E1 0 antibodies thereto (Evan et aL, Molecular and Cellular Biology 5:361 0-361 6 (1 985)); 
and the Herpes Simplex virus glycoprotein D (gD) tag and its antibody (Paborsky et al., 
Protein Engineering 3(6):547-553 (1990)). Other tag polypeptides include the FLAG-peptide 
(Hopp etaL, BioTechnology ^^204-^2^0 (1988)); the KT3 epitope peptide (Martin etaL, 
Science 255:192-194 (1992)); tubulin epitope peptide (Skinner et aL, J. BioL Chem. 

35 266:15163-15166 (1991)); and the T7 gene 10 protein peptide tag (Lutz-Freyermuth etaL, 
Proc. Natl, Acad. ScL USA 87:6393-6397 (1990)). 



5 



of the nucleotide sequence of SEQ ID NO:2 as shown in the sequence listing 
to select sequences which hybridize with the nucleotide sequence comprising 
all or part of the nucleotide sequence of SEQ ID NO:2 as shown in the 



sequence listing from the gene library, then isolating the selected sequences, 
and isolating a BagCel gene from the sequences which have been selected 
and isolated from the gene library. 
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189J Conditions appropriate for expression of said BagCel gene comprises providing to 
the culture the components necessary for growth and/or expression of the inventive 
cellulase. Optimal conditions for the production of the proteins will vary with the choice of 
the host cell, and with the choice of protein to be expressed. Such conditions will be easily 

5 ascertained by one skilled in the art through routine experimentation or optimization. 

[90] The protein of interest is typically purified or isolated after expression. The protein of 
interest may be isolated or purified in a variety of ways known to those skilled in the art 
depending on what other components are present in the sample. Standard purification 
methods include electrophoretic. molecular, immunological and chromatographic techniques, 

10 including ion exchange, hydrophobic, affinity, and reverse-phase HPLC chromatography, 

and chromatofocusing. For example, the protein of interest may be purified using a standard 
anti-protein of interest antibody column. Ultrafiltration and diafiltration techniques, in 
conjunction with protein concentration, are also useful. For general guidance in suitable 
purification techniques, see Scopes, Protein Purification (1982). The degree of purification 

15 necessary will vary depending on the use of the protein of interest. In some instances no 
purification will be necessary. 

V. Utility of cellulase 

[911 Treatment of textiles according to the present invention contemplates textile 

20 processing or cleaning with a composition comprising the cellulase of this invention. Such 
treating includes, but is not limited to. stonewashing, modifying the texture, feel and/or 
appearance of cellulose-containing fabrics or other techniques used during manufacturing or 
cleaning/reconditioning of cellulose-containing fabrics. Additionally, treating within the 
context of this invention contemplates the removal of "immature" or "dead" cotton from 

25 cellulosic fabric or fibers. Immature cotton is significantly more amorphous than mature 
cotton and because of, for example, uneven dyeing. The composition contemplated in the 
present invention further includes a cellulase component for use in washing a soiled 
manufactured cellulose-containing fabric. For example, a cellulase of this invention may be 
used in a detergent composition for washing laundry. Detergent compositions useful in 

30 accordance with the present invention include special formulations such as pre-wash, pre- 
soak and home-use color restoration compositions. Such treating compositions, as 
described herein, may be in the form of a concentrate which requires dilution or in the form 
of a dilute solution or a form which can be applied directly to the cellulose-containing fabric. 
General treatment techniques for cellulase treatment of textiles are described in, for 

35 example, EP Publication No. 220 016 and GB Application Nos. 1,368,599 and 2,095.275. 
[921 Treatment of a cellulosic material according to the present invention further 
contemplates the treatment of animal feed, pulp and/or paper, food and grain for purposes 
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known in the art. For example, cellulases are known to increase the value of animal feed, 
Improve the drainability of wood pulp, enhance food products and reduce fiber in grain 
during the grain wet milling process or dry milling process. 

[93] Treating according to the instant invention comprises preparing an aqueous solution 
S which contains an effective amount of a cellulase or a combination of cellulases together 
with other optional ingredients including, for example, a buffer, a surfactant, and/or a 
scouring agent. An effective amount of a cellulase enzyme composition is a concentration of 
cellulase enzyme sufficient for its intended purpose. Thus, for example, an "effective 
amount" of cellulase in a stonewashing composition according to the present invention is 

1 0 that amount which will provide the desired effect, e.g., to produce a wom and faded look in 
seams and on fabric panels. Similariy, an "effective amount" of cellulase in a composition 
intended for improving the feel and/or appearance of a cellulose-containing fabric is the 
amount that produces measurable improvements in the feel, e.g., improving the smoothness 
of the fabric, or appearance, e.g., removing pills and fibrils which tend to reduce the 

IS sharpness in appearance of a fabric. The amount of cellulase employed is also dependent 
on the equipment employed, the process parameters employed (the temperature of the 
cellulase treatment solution, the exposure time to the cellulase solution, and the like), and 
the cellulase activity (e.g., a particular solution will require a lower concentration of cellulase 
where a more active cellulase composition is used as compared to a less active cellulase 

20 composition). The exact concentration of cellulase in the aqueous treatment solution to 

which the fabric to be treated is added can be readily determined by the skilled artisan based 
on the above factors as well as the desired result. In stonewashing processes, it has 
generally been preferred that the cellulase be present in the aqueous treating solution in a 
concentration of from about 0.5 to 5,000 ppm and most preferably about 10 to 200 ppm total 

25 protein. In compositions for the improvement of feel and/or appearance of a cellulose- 
containing fabric, it has generally been preferred that the cellulase be present in the aqueous 
treating solution in a concentration of from about 0.1 to 2000 ppm and most preferably about 
0.5 to 200 ppm total protein. 

[94] In a preferred treating embodiment, a buffer is employed in the treating composition 
30 such that the concentration of buffer is sufficient to maintain the pH of the solution within the 
range wherein the employed cellulase exhibits activity. The pH at which the cellulase 
exhibits activity depends on the nature of the cellulase employed. The exact concentration 
of buffer employed will depend on several factors which the skilled artisan can readily take 
into account. For example, in a preferred embodiment, the buffer as well as the buffer 
35 concentration are selected so as to maintain the pl-l of the final cellulase solution within the 
pH range required for optimal cellulase activity. The determination of the optimal pH range 
of the cellulases of the invention can be ascertained according to well-known techniques. 

18 
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Suitable buffers at pH wHhin the activity range of the cellulase are also well known to those 
skilled in the art in the field. 

1951 In addition to cellulase and a buffer, the treating composition may optionally contain a 
surfactant. Suitable surfactants include any surfactant compatible with the cellulase being 

5 utilized and the fabric including, for example, anionic, non-ionic and ampholytic surfactants. 
Suitable anionic surfactants include, but are not limited to, linear or branched 
alkylbenzenesulfonates; alkyi or alkenyl ether sulfates having linear or branched alkyi groups 
or alkenyl groups; alkyi or alkenyl sulfates; olefinsulfonates; alkanesulfonates and the like. 
Suitable counter ions for anionic surfactants include, but are not limited to, alkali metal ions 

10 such as sodium and potassium; alkaline earth metal Ions such as calcium and magnesium; 
ammonium ion; and alkanolamines having 1 to 3 alkanol groups of carbon number 2 or 3. 
Ampholytic surfactants include, e.g., quaternary ammonium salt sulfonates, and betaine-type 
ampholytic surfactants. Such ampholytic surfactants have both the positive and negative 
charged groups in the same molecule. Nonlonic surfactants generally comprise 

1 5 polyoxyalkylene ethers, as well as higher fatty acid alkanolamides or alkylene oxide adduct 
thereof, and fatty acid glycerine monoesters. Mixtures of surfactants can also be employed 
in manners known to those skilled in the art. 

[961 A concentrated cellulase composition can be prepared for use in the methods 
described herein. Such concentrates contain concentrated amounts of the cellulase 

20 composition described above, buffer and surfactant, preferably in an aqueous solution. 
When so formulated, the cellulase concentrate can readily be diluted with water so as to 
quickly and accurately prepare cellulase preparations having the requisite concentration of 
each constituent. When aqueous concentrates are formulated, these concentrates can be 
diluted so as to arrive at the requisite concentration of the components in the cellulase 

25 solution as indicated above. As is readily apparent, such cellulase concentrates permit facile 
formulation of the cellulase solutions as well as permit feasible transportation of the 
composition to the location where it will be used. The treating concenti-ate can be In any art- 
recognized form, for example, liquid, emulsion, gel, or paste. Such forms are well known to 
those skilled in the art. 

30 1971 When a solid cellulase concentrate is employed, the cellulase composition may be a 
granule, a powder, an agglomerate or a solid disk. The granules can be formulated so as to 
contain materials to reduce the rate of dissolution of the granules into the wash medium. 
Such materials and granules are disclosed in U.S. Patent No. 5.254.283 which is 
incorporated herein by reference in its entirety. 

35 198] Other materials can also be used with or placed in the cellulase composition of the 
present invention as desired, including stones, pumice, fillers, solvents, enzyme activators, 
and anti-redeposition agents depending on tiie eventual use of the composition. 
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[99] By way of example, stonewashtng methods will be described in detail, however, the 
parameters described are readily modified by the skilled artisan for other applications, Le., 
improving the feel and/or appearance of a fabric. The cellulose-containing fabric is 
contacted with the cellulase containing stonewashing composition containing an effective 
5 amount of the cellulase by intermingling the treating composition with the stonewashing 
composition, and thus bringing the cellulase enzyme into proximity with the fabric. 
Sujbsequently, the aqueous solution containing the cellulase and the fabric is agitated, if the 
treating composition is an aqueous solution, the fabric may be directly soaked in the 
solution. Similariy, where the stonewashing composition is a concentrate, the concentrate is 
10 diluted into a water bath with the cellulose-containing fabric. When the stonewashing 

composition is in a solid form, for example a pre-wash gel or solid stick, the stonewashing 
composition may be contacted by directly applying the composition to the fabric or to the 
wash liquor. 

[100] The cellulose-containing fabric is incubated with the stonewashing solution under 
1 5 conditions effective to allow the enzymatic action to confer a stonewashed appearance to 
the cellulose-containing fabric. For example, during stonewashing, the pH, liquor ratio, 
temperature and reaction time may be adjusted to optimize the conditions under which the 
stonewashing composition acts. "Effective conditions" necessarily refers to the pH, liquor 
ratio, and temperature which allow the cellulase enzyme to react efficiently with cellulose- 
20 containing fabric, in this case to produce the stonewashed effect. It is within the skill of 

those in the art to maximize conditions for using the stonewashing compositions according to 
the present invention. 

1101] The liquor ratios during stonewashing, Le,, the ratio of weight of stonewashing 
composition solution {i.e., the wash liquor) to the weight of fabric, employed herein is 
25 generally an amount sufficient to achieve the desired stonewashing effect in the denim fabric 
and is dependent upon the process used. Preferably, the liquor ratios are from about 4:1 to 
about 50:1 ; more preferably from about 5:1 to about 20:1 , and most preferably from about 
10:1 to about 15:1. 

1102] Reaction temperatures during stonewashing with the present stonewashing 
30 compositions are governed by two competing factors. Firstly, higher temperatures generally 
correspond to enhanced reaction kinetics, Le,, faster reactions, which permit reduced 
reaction times as compared to reaction times required at lower temperatures. Accordingly, 
reaction temperatures are generally at least about lO^^C and greater. Secondly, cellulase is 
a protein which loses activity beyond a given reaction temperature, which temperature is 
35 dependent on the nature of the cellulase used. Thus, if the reaction temperature is permitted 
to go too high, the celluiolyttc activity is lost as a result of the denaturing of the cellulase. 
While standard temperatures for cellulase usage in the art are generally in the range of 35'C 
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to 65°C, and these conditions would also be expected to be suitable for the cellulase of the 
invention, the optimal temperature conditions should be ascertained according to well known 
techniques with respect to the specific cellulase used. 

(103J Reaction times are dependent on the specific conditions under which the 
stonewashing occurs. For example. pH. temperature and concentration of cellulase will all 
affect the optimal reaction time. Generally, reaction times are from about 5 minutes to about 
5 hours, and preferably from about 10 minutes to about 3 hours and, more preferably, from 
about 20 minutes to about 1 hour. 

[1041 According to yet another preferred embodiment of the present invention, the cellulase 
of the invention may be employed in a detergent composition. The detergent compositions 
according to the present invention are useful as pre-wash compositions, pre-soak 
compositions, or for cleaning during the regular wash or rinse cycle. Preferably, the 
detergent composition of the present invention comprises an effective amount of cellulase. a 
surfactant, and optionally includes other ingredients described below, 
f 1051 An effective amount of cellulase employed in the detergent compositions of this 
invention is an amount sufficient to impart the desirable effects known to be produced by 
cellulase on cellulose-containing fabrics, for example, depilling. softening, anti-pilling. 
surface fiber removal, anti-graying and cleaning. Preferably, the cellulase in the detergent 
composition is employed in a concentration of from about 10 ppm to about 20.000 ppm of 
detergent. 

(1061 The concentration of cellulase enzyme employed in the detergent composition Is 
preferably selected so that upon dilution into a wash medium, the concentration of cellulase 
enzyme is in a range of about 0.01 to about 1000 ppm, preferably from about 0.02 ppm to 
about 500 ppm. and most preferably from about 0.5 ppm to about 250 ppm total protein. 
The amount of cellulase enzyme employed in the detergent composition will depend on the 
extent to which the detergent will be diluted upon addition to water so as to form a wash 
solution. ' 

(1071 The detergent compositions of the present invention may be in any art recognized 
form, for example, as a liquid, in granules, in emulsions, in gels, or In pastes. Such forms 
are well known to the skilled artisan. When a solid detergent composition is employed, the 
cellulase is preferably formulated as granules. Preferably, the granules can be formulated 
so as to additionally contain a cellulase protecting agent. The granule can be formulated so 
as to contain materials to reduce the rate of dissolution of the granule into the wash medium. 
Such materials and granules are disclosed in U.S. Patent No. 5.254,283 which is 
incorporated herein by reference in its entirety. 
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[108] The detergent compositions of this invention employ a surface active agent, /.e., 
surfactant, including anionic, non-ionic and ampholytic surfactants well known for their use in 
detergent compositions. 

[109] Suitable anionic surfactants for use in the detergent composition of this invention 
5 include linear or branched alkylbenzenesulfonates; alkyi or alkenyl ether sulfates having 
linear or branched alky! groups or alkenyl groups; alkyI or alkenyl sulfates; olefinsulfonates; 
and alkanesul-fonates. Suitable counter ions for anionic surfactants include alkali metal ions 
such as sodium and potassium; alkaline earth metal ions such as calcium and magnesium; 
ammonium ion; and alkanolamines having 1 to 3 alkanol groups of carbon number 2 or 3. 

10 Ampholytic surfactants include quatemary ammonium salt sulfonates, and betaine-type 
ampholytic surfactants. Such ampholytic surfactants have both the positive and negative 
charged groups in the same molecule, Nonlonic surfactants generally comprise polyoxyal- 
kylene ethers, as well as higher fatty acid alkanolamides or alkylene oxide adduct thereof, 
fatty acid glycerine monoesters, and the like. Suitable surfactants for use in this invention 

1 5 are disclosed in British Patent Application No. 2 094 826 A, the disclosure of which is 
incorporated herein by reference. Mixtures of such surfactants can also be used. The 
surfactant or a mixture of surfactants is generally employed in the detergent compositions of 
this invention in an amount from about 1 weight percent to about 95 weight percent of the 
total detergent composition and preferably from about 5 weight percent to about 45 weight 

20 percent of the total detergent composition. In addition to the cellulase composition and the 
surfactant(s), the detergent compositions of this invention can optionally contain one or more 
of the following components: 
Hydrolases Except Cellulase 

11101 Suitable hydrolases include carboxylate ester hydrolase, thioester hydrolase, 
25 phosphate monoester hydrolase, and phosphate diester hydrolase which act on the ester 

bond; glycoside hydrolase which acts on glycosyl compounds; an enzyme that hydrolyzes N- 
glycosyl compounds; thioether hydrolase which acts on the ether bond; and a-amijio-acyl- 
peptide hydrolase, peptidyl-amino acid hydrolase, acyl-amino acid hydrolase, dipeptide 
hydrolase, and peptidyl-peptide hydrolase which act on the peptide bond. Preferable among 
30 them are cart>oxylate ester hydrolase, glycoside hydrolase, and peptidyl-peptide hydrolase. 
Suitable hydrolases include (1) proteases belonging to peptidyl-peptide hydrolase such as 
pepsin, pepsin B, rennin, trypsin, chymotrypsin A, chymotrypsin B, elastase, enterokinase, 
cathepsin C, papain, chymopapain, ficin. thrombin, fibrinolysin, renin, subtilisin, 
aspergillopeptidase A, coilagenase, clostridiopeptidase B, kallikrein, gastrisin. cathepsin D., 
35 bromelin, keratinase, chymotrypsin C, pepsin C. aspergillopeptidase B, urokinase, 

carboxypeptidase A and B, and aminopeptidase; (2) glycoside hydrolases (cellulase which is 
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an essential ingredient is excluded from this group) a-amylase. ft-amylase. gluco amylase, 
invertase. lysozyme. pectinase. chitinase. and dextranase. Preferably among them are a- 
amylase and B-amylase. They function in acid to neutral systems, but one which is obtained 
from bacteria exhibits high activity in an alkaline system; (3) carboxylate ester hydrolase 
5 including carboxyl esterase, lipase, pectin esterase, and chlorophyllase. Especially effective 
among them is lipase. 

[1111 The hydrolase other than cellulase is incorporated into the detergent composition as 
much as required according to the purpose. It should preferably be incorporated in an 
amount of 0.001 to 5 weight percent, and more preferably 0.02 to 3 weight percent, in temis 

10 of purified protein. This enzyme should be used in the form of granules made of crude 
enzyme alone or in combination with other components in the detergent composition. 
Granules of crude enzyme are used in such an amount that the purified enzyme is 0.001 to 
50 weight percent in the granules. The granules are used in an amount of 0.002 to 20 and 
preferably 0.1 to 10 weight percent. As with cellulases. these granules can be formulated so 

15 as to contain an enzyme protecting agent and a dissolution retardant material. 
ratinnic Surfactants and I nnn-Chain Fatty Add Salts 

11121 Such cationic surfactants and long-chain fatty acid salts include saturated or 
unsaturated fatty acid salts. alkyI or alkenyl ether carboxyllc acid salts, a-sulfofatty acid salts 
or esters, amino acid-type surfactants, phosphate ester surfactants, quaternary ammonium 
20 salts including those having 3 to 4 alkyI substituents and up to 1 phenyl substituted alkyI 
substituents. Suitable cationic surfactants and long-chain fatty acid salts are disclosed in 
British Patent Application No. 2 094 826 A. the disclosure of which is incorporated herein by 
reference. The composition may contain from about 1 to about 20 weight percent of such 
cationic surfactants and long-chain fatty acid salts. 

25 Builders 

A Divalent sequesterin g agents 

(1131 The composition may contain from about 0 to about 50 weight percent of one or more 
builder components selected from the group consisting of alkali metal salts and alkanolamine 
salts of the following compounds: phosphates, phosphonates. phosphonocarboxylates. salts 

30 of amino acids, aminopolyacetates high molecular electrolytes, non-dissociating polymers, 
salts of dicarboxylic acids, and aluminosilicate salts. Suitable divalent sequestering gents 
are disclosed in British Patent Application No. 2 094 826 A. the disclosure of which is 
incorporated herein by reference. 
R Alkalis or inorganic electrolytes 

35 (1141 The composition may contain from about 1 to about 50 weight percent, preferably 
from about 5 to about 30 weight percent, based on the composition of one or more alkali 
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metal salts of the following cx)mpounds as the alkalis or inorganic electrolytes: silicates, 
carbonates and sulfates as well as organic alkalis such as triethanolamine, diethanolamine, 
monoethanolamine and triisopropanolamine. 
Antiredeposltion Agents 

[115] The composition may contain from about 0.1 to about 5 weight percent of one or 
more of the following compounds as antiredeposltion agents: polyethylene glycol, polyvinyl 
alcohol, polyvinylpyrrolidone and carboxymethylcellulose, 

[116J Among them, a combination of carboxymethyl-cellulose and/or polyethylene glycol 
with the cellulase composition of the present invention provides for an especially useful dirt 
removing composition. 
Bleaching Agents 

[117] The use of the cellulase of the present invention in combination with a bleaching 
agent such as potassium monopersulfate, sodium percarbonate, sodium perborate, sodium 
sulfate/hydrogen peroxide adduct and sodium chloride/hydrogen peroxide adduct or/and a 
photo-sensitive bleaching dye such as zinc or aluminum salt of sulfonated phthalocyanine 
further improves the detergenting effects. Similarly, bleaching agents and bleach catalysts 
as described in EP 684 304 may be used. 
Bluing Agents and Fluorescent Dves 

[118] Various bluing agents and fluorescent dyes may be incorporated in the composition, 
if necessary. Suitable bluing agents and fluorescent dyes are disclosed in British Patent 
Application No. 2 094 826 A, the disclosure of which is incorporated herein by reference. 
Caking Inhibitors 

[119] The following caking inhibitors may be incorporated in the powdery detergent: p- 
toluenesutfonic acid salts, xylenesulfonic acid salts, acetic acid salts, sulfosuccinic add salte, 
talc, finely pulverized silica, amorphous silicas, clay, calcium silicate (such as Micro-Cell of 
Johns Manville Co.), calcium carbonate and magnesium oxide. 
Antioxidants 

[120] The antioxidants include, for example, tert-butyl-hydroxytoluene, 4,4*-butylidenebis(6- 
tert-butyl-3-methylphenol), 2,2-butylidenebis(6-tert-butyl-4-methylphenol), monostyrenated 
cresol, distyrenated cresol, monostyrenated phenol, distyrenated phenol and 1,1-bis(4- 
hydroxy-phenyl)cyclohexane. 

Solubilizers 

(1211 The solubilizers include, for example, lower alcohols such as ethanol. 
benzenesulfonate salts, lower alkylbenzenesulfonate salts such as p-toluenesulfonate salts, 
glycols such as propylene glycol, acetylbenzene-sulfonate salts, acetamides. 
pyridinedicarboxylic acid amides, benzoate salts and urea. 
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[122] The detergent composition of the present invention can be used In a broad pH range 
from acidic to alkaline pH. In a preferred embodiment, the detergent composition of the 
present invention can be used in mildly acidic, neutral or alkaline detergent wash media 
having a pH of from above 5 to no more than about 12. 

5 [123] Aside from the above ingredients, perfumes, buffers, preservatives, dyes and the like 
can be used, if desired, with the detergent compositions of this invention. Such components 
are conventionally employed in amounts heretofore used in the art. 
[124] When a detergent base used in the present invention is in the fomn of a powder, it 
may be one which is prepared by any known preparation methods including a spray-drying 

10 method and a granulation method. The detergent base obtained particularly by the spray- 
drying method, agglomeration method, dry mixing method or non-tower route methods are 
preferred. The detergent base obtained by the spray-drying method is not restricted with 
respect to preparation conditions. The detergent base obtained by the spray-drying method 
is hollow granules which are obtained by spraying an aqueous slun^ of heat-resistant 

15 ingredients, such as surface active agents and builders, into a hot space. After the spray- 
drying, perfumes, enzymes, bleaching agents, inorganic alkaline builders may be added. 
With a highly dense, granular detergent base obtained such as by the spray-drying- 
granulation or agglomeration method, various ingredients may also be added after the 
preparation of the base. 

20 [1251 When the detergent base is a liquid, it may be either a homogeneous solution or a 

nonhomogeneous dispersion. For removing the decomposition of carboxymethylcellulose by 
the cellulase in the detergent, it is desirable that carboxymethylcellulose is granulated or 
coated before the incorporation in the composition. 

[126) The detergent compositions of this invention may be incubated with cellulose- 
25 containing fabric, for example soiled fabrics, in industrial and household uses at 
temperatures, reaction times and liquor ratios conventionally employed In these 
environments. 

[1271 Detergents according to the present invention may additionally be fomfiulated as a 
pre-wash in the appropriate solution at an intemiediate pH where sufficient activity exists to 

30 provide desired improvements softening, depilling. pilling prevention, surface fiber removal 
or cleaning. When the detergent composition is a pre-soak (e.g., pre-wash or pre-treatment) 
composition, either as a liquid, spray, gel or paste composition, the cellulase enzyme is 
generally employed from about 0.0001 to about 1 weight percent based on the total weight 
of the pre-soak or pre-treatment composition. In such compositions, a surfactant may 

35 optionally be employed and when employed, is generally present at a concentration of from 
about 0.005 to about 20 weight percent based on the total weight of the pre-soak. The 
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remainder of the composition comprises conventional components used in the pre-soak, /.e., 
diluent, buffers, other enzymes (proteases), and the lil^e at their conventional concentrations. 
[128] It is contemplated that compositions comprising cellulase enzymes described herein 
can be used in home use as a stand alone composition suitable for restoring color to faded 
5 fabrics (see, for example, U.S. Patent No. 4,738,682, which is incorporated herein by 
reference in its entirety) as well as used in a spot-remover and for depilling and antipilling 
(pilling prevention). 

[129] The use of the cellulase according to the invention may be particularly effective in 
feed additives and in the processing of pulp and paper. These additional industrial 
10 applications are described in, for example, PCT Publication No. 95/16360 and Finnish 
Granted Patent No. 87372, respectively. 

[130] In order to further illustrate the present invention and advantages thereof, the 
following specific examples are given with the understanding that they are being offered to 
illustrate the present invention and should not be construed in any way as limiting Its scope. 



[131] The following examples are offered to illustrate, but not to limit the claimed invention. 



[132] This example illustrates how to collect samples and process them to obtain sufficient 

20 DNA to create a cDNA library. 

(133J Samples of water (250 ml) were collected from the littoral zone of Sonachi (Crater) 
Lake. Kenya using a 250-ml stainless steel beaker mounted on the end of a flexible 
extendible 1-m pole and placed in sealable plastic containers (Whirlpak) for transport to the 
laboratory at ambient temperature. The temperature of the surface waters was 28''C, with pH 

25 10 and a conductivity of 7.23 mS cm*^ (at 27*C). 

[134] To collect the microbial flora, water (750 ml) from Sonachi (Crater) Lake. Kenya was 
filtered on site (using a hand operated vacuum pump) through a sequence of sterile 
membrane filters (47 mm diameter), composed of cellulose nitrate or cellulose acetate, of 
decreasing pore size, until all water flow stopped. The sequence of filters was S^m, 3^m and 

30 0.22^m. The individual membrane filters were placed immediately into 10 ml of cold, sterile 
cell stabilization buffer (TES) containing 10 mM Tris HCI. pH8.0; 1 mM EDTA and 5% w/v 
NaCI in 30 ml sterile plastic universal tubes and kept on ice in a refrigerated cool box until 
they could be processed further, usually within 4 hours of sampling. The microbial material 
on the filters was dispersed by vigorous vortex mixing with sterile glass beads (5 ml) and the 

35 cells pelleted in microfuge tubes by centrifugation at 13,000g for 5 min. The microbial 
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material was aliquoted to the microfuge tubes in volumes estimated to contain the equivalent 
of 10® to 10^ bacterial cells, giving a total of 12 tubes. The DNA was extracted using the 
GenomicPrepT'^ Cells and Tissue DNA isolation kit (Amersham Phannacia Biotech. 
Piscataway. NJ. USA) following the manufacturer's instructions. Cells in each tube were 
resuspended in 600 ^l of the Cell Lysis Solution provided, and incubated at SOX for 5 min to 
lyse the cells. Samples prepared by this method are stable at room temperature for at least 
18 months, and were transported back to the laboratory in this fomn. DNA extraction was 
completed by RNase A treatment, protein precipitation and isopropanol precipitation of the 
DNA following the manufacturer's protocol. Each DNA pellet was dissolved in 100^1 sterile 
Tris buffer 10mM pH 8.5. 

(135) DNA yield was estimated by running 5^l samples on a 0.5% w/v agarose gel and 
comparing with known amounts of bacterial genomic DNA. The samples were pooled, giving 
a total of about 20ng DNA. Since yields were low. the material was supplemented with about 
30% extra material extracted from the water samples which were collected at the same time 
as the on-site material and stored at 4**C in the laboratory until required. This amount of 
DNA. about SO^g. was the amount of starting material that preliminary experiments had 
shown was needed to carry out the trial and bulk restriction digestion and size f ractionation 
to give sufficient material for library construction. 

Example 2 
Library Construction 

(1361 The following example details how to prepare a DNA library for use in screening and 

detection of novel sequences in E. coli. 

Preparation of DNA 

[1371 The pooled DNA was used for construction of the genomic DNA library. The purified 
DNA was partially digested with Sau3A1 to give an average fragment size of about 5 kb. 
Restricted DNA was size fractionated by electrophoresis on 0.5% agarose in TAE (0.04M 
Tris-acetate. O.OOIiyi EDTA pH 8.0). Material in the 1.5 to 10 kb range was excised and 
replaced in a well of the same size cut in an unused part of the agarose gel and 
concentrated to a narrow band by reversed electrical current. The DNA band was excised 
and DNA extracted using the QIAGEN (Crawley. UK) QIAEXII gel extraction kit, following the 
manufacturer's guidelines. The eluted DNA was precipitated with ethanol and resuspended 
in 10 mM Tris HCI buffer. pH 8.5. 

Preparation of Lambda libraries 
(1381 The restricted DNA was cloned into a Lambda vector using the 2AP-Express™ 

vector kit (predigested with SamHI and alkaline phosphatase treated) and the Gigapak® III 

Gold packaging extract (Stratagene. Amsterdam, The Netherlands) following the 
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manufacturer's protocol. The primary libraries were amplified as per protocol by plating 
aliquots containing -5x10*pfu with host E. co// strain XL1-Blue MRP on 150mm Petri dishes 
and eluting the phage in buffer. Amplified libraries were stored in 7% v/v dimethyl sulphoxide 
at -80"C after freezing in liquid nitrogen. The total primary titre was 1.8 x 10* pfu and after 
amplification 6.8 x 10' pfu m\'\ 

Assessment of Library Quality 
1139] The phagemid vector pBK-CMV was excised from the Lambda ZAP library using 
ExAssist helper phage (Stratagene) as described by the manufacturer, and used to infect 
E.CO// strain XLOLR. Plasmid-containing clones were isolated by plating on Luria - BertanI 
(LB) agar containing 50 ng mr^ kanamycin. Blue:white screening in the presence of Xgal [5- 
bromo-4-chloro-3-indoyl-p-D-galactoside] and IPTG [isopropylthio-p-D-galactosideJ was 
used to determine cloning efficiency. If no DNA has been cloned into the Lambda vector, the 
3-galactosidase gene is expressed in the presence of the inducer IPTG, resulting in 
cleavage of the substrate analogue Xgal to produce a blue pigment in the colony. If however 
a fragment of the genomic DNA has been successfully cloned into the Lambda vector it 
disrupts the gene so that no enzyme Is produced and the colony remains white. The ratio of 
blue to white colonies therefore can be used to calculate the percentage of clones containing 
an insert. For this library the blue:white screen gave a ratio of 7 blue to 286 white colonies, 
indicating that 97% of the clones contained an insert of the genomic DNA. Twenty four 
colonies were selected at random and plasmid DNA prepared using the Wizard®Plus SV 
Miniprep DNA purification system (Promega UK, Southampton) Restriction analysis using 
Psti and H/hdIil which flank the BamHI cloning site followed by agarose gel electrophoresis 
was used to detennine Insert sizes. One clone out of the 24 was found to have no detectable 
insert. The rest had inserts ranging from 1 .5 kb to 8.0 kb. 

Example 3 
Library screening for cellulases 

U40J DNA libraries in the pBK-CMV phagemid were screened for cellulase activity in a 

plate assay of the E. co// clones. To detect cellulase activity the genomic libraries were 

plated on LB agar containing kanamycin. 0.5% w/v carboxymethylcellulose (low viscosity 

sodium salt; Sigma, Poole. UK) and IPTG (15 nl of a 0.5 M solution spread on the surface of 

the agar in a 7 cm diameter Petri dish). Following overnight growth at 37''C. the colonies 

were overiayed with 3ml molten 0.7% w/v agarose dissolved in water which had been cooled 

to 50°C. After this had set. the plates were flooded with 0.1% w/v Congo Red solution for 30 

minutes followed by 2 washes with 1 M NaCI. Positive clones exhibiting extracellular 

cellulase activity were surrounded by a yellow halo against a red background (R. Teather 

and P.J. Wood, Applied & Environmental Microbiology, 43: 770-780, 1982). 
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[141] The screening of 11 0.000 E.co// pBK-CMV clones yielded 4 zones of clearing 
indicating potential cellullase-producing colonies. Three of these were successfully 
recovered as cellulase-producing clones after homogenising the agar plug removed from the 
cleared zone, streaking out for single colonies and confirming the phenotype by the Congo 



[142] Plasmid DNA was isolated from the three cellulase positive clones, and the size of 
the inserts determined by restriction digestion as described above. All three had the same 

10 size (about 3.5kb) and the same size fragments after digestion as detemiined by gel 

electrophoresis. This indicated that all three isolates were identical, derived by amplification 
of a single clone. This was confirmed by the first round of sequencing of the plasmid DNA 
(using primer sites in the pBKCMV plasmid). This was carried out by the Protein and 
Nucleic Acid Chemistry Laboratory at Leicester University, using the Perkin Elmer 'BigDye' 

15 terminator chemistry and the model 377 ABI automated DNA sequencer. Complete 

coverage of the sequence was obtained by 'primer walking' from both the 5'and 3' ends of 
the insert. The sequence was edited using Applied Biosystems multisequence editor 
Seqed™ version 1.0.3. Sequence was assembled with programmes in the GCG Wisconsin 
Package, version 10.2-UNIX, available at the University of Leicester. This identified an 

20 insert of environmental DNA of 4205 nucleotide bases (Figure 1 ). 



[143] Possible Open Reading Frames (ORF) in the nucleotide sequence of the inserted 
environmental DNA of clone BagCel were identified using the ORF Find facility of the 
25 MapDraw program (DNASTAR, Brighton, MA, USA) or ORF Search from the Vector NTI 
Suite of programs (InforMax®, North Bethesda, MD, USA). 

[144] This identified an ORF composed of 1713 nucleotides corresponding to a protein of 
570 amino acids, starting at position 1532 of the insert sequence and ending at position 
3244. The sequence of this ORF was excised using EditSeq (DNASTAR) and examined by 

30 BLAST programs. 

[1451 The nucleotide sequence of this ORF is shown in Figure 2. 
[146] An examination of the nucleotide sequence using the BLASTn program, which 
compares a nucleotide query sequence against a non-redundant nucleotide sequence 
database, indicated identity with parts of the Bacillus halodurans genome. 

35 (1471 An examination of the nucleotide sequence using the BLASTx program, which 
compares the six-frame conceptual translation products of a nucleotide query sequence 
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(both strands) against a protein sequence database indicated significant similarity to a 
number of bacterial endocellulases. The highest alignment score revealed 66% identity (103 
amino acids) to a 154 amino acid region of an endo-p-1 ,4-glucanase (celluiase B) of the 
facultative alkaliphilic bacterium Bacillus halodurans strain C125, an enzyme comprising 574 
5 amino acids (protein id BAB04322, accession AP001509). This newly identified gene coded 
for a protein of 570 amino acids with 66% identity to the Bacillus halodurans celluiase. 
[148) The translated protein composed of 570 amino acids is shown in Figure 3. 

Example 6 
Enzyme Characterization 

10 Influence of salt 

[1491 Cells of E.co// pBK-CMV containing the BagCel gene is suspended in 5 ml buffer (20 

mMTRIS-HCI. pH8.0; 500 mM NaCI; 0.1 mM EDTA; 0.1% Triton X-100) and disrupted by 

sonication on ice. The sonicated extracts are examined by agar diffusion assay on 

carboxymethylcellulose (CMC) at different NaCI concentrations. Sonicated extracts (100 (iL) 

15 and 1 in 10 dilutions are placed in wells punched in CMC-agar plates containing varying 

amounts of NaCI. The plates are incubated at 37*C for 16 hours and the resulting clearing 

zones indicating cellulose hydrolysis measured in millimetres. The celluiase BagCel is active 

over the range 0 - 25% w/v NaCI, although the activity at 25% w/v NaCI is only about 50% of 

the activity at 0% NaCL 

20 

Influence ofoH 

[150] The influence of pH on celluiase activity is investigated using the pH-gradient plate 
method described by Grant & Tindall (Isolation of alkaliphilic bacteria, In: Microbial Growth 
and Survival in Extreme Environments, Academic Press. London. 1980, pp. 27-36). An agar 

25 medium containing CMC is poured to a depth of 1 cm in square Petri dishes and allowed to 
set. A uniform trough 1 cm wide is cut from one edge of the plate and agar containing 20% 
w/v Na2CO3.10H20 and 0.2 M NaOH (prepared by mixing equal volumes of sterile 0.4 M 
NaOH/40% w/v Na2CO3.10H2O and 4% w/v agar at 60'C) is poured into the trough. The 
plates are developed at 37**C overnight to allow a uniform gradient from pH 12 to pH 7 to 

30 form. To test the pH tolerance of the BagCel celluiase a narrow trough is cut through the 
(agar) gradient at right angles to the original trough and filled with 1 ml of sonicated cell 
extract. The plates are allowed to develop overnight at 37**C. The plates re treated with 
Congo Red for 30 minutes to visualize the zone of cellulose hydrolysis. The BagCel 
celluiase is active to about pH 1 1 .5. 

35 
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[151] It is understood that the examples and embodiments described herein are for 
illustrative purposes only and that various modifications or changes in light thereof will be 
suggested to persons skilled in the art and are to be included within the spirit and purview of 
this application and scope of the appended claims. All publications, patents, and patent 
applications cited herein are hereby incorporated by reference in their entirety for all 
purposes. 
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What is claimed is: 

1 . An isolated polynucleotide selected from the group consisting of: 

(a) a nucleic acid sequence having at least 85% sequence identity to presented as 
5 SEQ ID NO:1 . or the complement thereof; 

(b) a nucleic acid sequence which encodes or is complementary to a sequence 
which encodes an BagCel polypeptide having at least 85% sequence identity to 
the amino acid sequence presented in Figure 3 (SEQ ID NO:3); 

(c) a nucleic acid sequence which encodes or is complementary to a sequence 

10 which encodes an BagCel polypeptide having at least 90% sequence identity to 

the amino acid sequence presented in Figure 3 (SEQ ID NO:3); 

(d) a nucleic acid sequence which encodes or is complementary to a sequence 
which encodes an BagCel polypeptide having at least 95% sequence identity to 
the amino acid sequence presented in Figure 3 (SEQ ID NO:3); 

15 (e) a nucleic acid sequence which encodes or is complementary to a sequence 

which encodes an BagCel polypeptide having the amino acid sequence 
presented in Figure 3 (SEQ ID NO:3); 

wherein said isolated polynucleotide encodes a polypeptide having the biological activity of a 
cellulase and wherein the identity is determined by the CLUSTAL-W program in MacVector 
20 version 6.5, operated with default parameters, including an open gap penalty of 10.0, an 
extended gap penalty of 0.1, and a BLOSUM 30 similarity matrix. 

2. An isolated polynucleotide selected from the group consisting of: 

(a) a nucleic acid sequence presented as SEQ ID NO:1. or the complement thereof; 
25 (b) a nucleic acid sequence that hybridizes, under high stringency conditions to the 

sequence presented as SEQ ID NO:1 . or the complement or a fragment thereof. 

(c) a nucleic acid sequence presented as SEQ ID NO:2, or the complement thereof; 
and 

(d) a nucleic acid sequence that hybridizes, under high stringency conditions to the 
30 sequence presented as SEQ ID NO:2, or the complement or a fragment thereof, 

wherein said isolated polynucleotide encodes a polypeptide having the biological activity of a 
cellulase and wherein hybridization is conducted at 42*'C in 50% formamide, 6X SSC. 5X 
Denhardt's solution. 0.5% SDS and 100 pg/ml denatured earner DHA followed by washing 
two times in 2X SSPE and 0.5% SDS at room temperature and two additional times in 0.1 
35 SSPE and 0.5% SDS at 42*C. 
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3. The isolated nucleotide of claim 1 wherein the nucleotide is selected from the group 
mRNA. DNA. cDNA. genomic DNA, and an antisense analog thereof. 

4. The isolated polynucleotide of Claim 3, wherein said polynucleotide is an RNA molecule. 

5 

5. The isolated polynucleotide of claim 1 encoding an enzyme having cellulase activity, 
wherein the enzyme is isolated from a Trichoderma source. 

6. The isolated polynucleotide of Claim 5, wherein the enzyme is isolated from Trichoderma 
10 reeseL 

7. An expression construct comprising a polynucleotide sequence encoding an amino acid 
sequence having cellulase activity and (i) having at least 85% sequence identity to the amino 
acid sequence presented in SEQ ID NO:3, or (ii) being capable of hybridizing to a probe 

15 designed to hybridize with the nucleotide sequence disclosed in Figure 2 under conditions of 
intermediate to high stringency, or (iii) being complementary to a nucleotide sequence 
having at least 85% sequence identity to a nucleotide sequence encoding the amino acid 
sequence presented in SEQ ID NO:3 wherein the identity is determined by the CLUSTAL-W 
program in MacVector version 6.5, operated with default parameters, including an open gap 

20 penalty of 10.0, an extended gap penalty of 0.1 , and a BLOSUM 30 similarity matrix.. 

\ 

8. A expression vector comprising the polynucleotide of Claim 1 . 

9. A expression vector comprising an isolated polynucleotide of Claim 1, operably linked to 
25 control sequences recognized by a host cell transformed with the vector. 

10. An expression vector according to Claim 9 comprising a regulatory polynucleotide 
sequence including a promoter sequence derived from a glucose isomerase gene of 
Actinoplanes, a signal sequence derived from a Streptomyces cellulase gene, and a 

30 polynucleotide sequence encoding a BagCel cellulase. 

11 . A vector comprising the expression construct of Claim 8. 

12. A host cell transformed with the vector of Claim 8. 

35 

13. The host cell of Claim 12, which is a prokaryotic cell. 

14. The host cell of Claim 12, which is a eukaryotic cell. 

40 15. A substantially purified BagCel polypeptide with the biological activity of a cellulase, 
comprising a sequence selected from the group consisting of: 

(a) an amino acid sequence having at least 85% sequence identity to the amino 
acid sequence presented in Figure 3 (SEQ ID NO:3); 

33 
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(b) an amino acid sequence having at least 90% sequence identity to the amino 
acid sequence presented in Figure 3 (SEQ ID NO:3); 

(c) an amino acid sequence having at least 95% sequence identity to the amino 
acid sequence presented in Figure 3 (SEQ ID NO:3); 

5 (d) an amino acid sequence presented in Figure 3 (SEQ ID NO:3); 

(e) a substantially purified biologically active fragment of the amino acid sequence 
presented as SEQ ID NO:3 
wherein the identity is determined by the CLUSTAL-W program in MacVector version 6.5. 
operated with default parameters, including an open gap penalty of 10.0, an extended gap 
10 penalty of 0.1, and a BLOSUM 30 similarity matrix. 

16. The substantially purified BagCel cellulase polypeptide or a derivative is provided which 
is obtainable from a Bacillus. 

15 1 7. A method of producing a cellulase comprising the steps of: 

(a) culturing the host cell according to claim 12 in a suitable culture medium under 
suitable conditions to produce the cellulase; 

(b) obtaining said produced cellulase. 

20 18. The method of Claim 17 wherein the host cell is a filamentous fungi or yeast cell. 

19. The method of Claim 17 wherein the host cell is a bacterium. 

20. The method of Claim 19 wherein the bacterium is a Streptomyces. 

21. A purified enzyme having cellulase activity prepared by the method of Claim 17. 

22. A recombinant host cell comprising a deletion or insertion or other alteration in the 
BagCel gene which inactivates the gene and prevents BagCel polypeptide production. 



25 
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23. An antisense oligonucleotide complementary to a messenger RNA that encodes an 
BagCel polypeptide having the sequence presented as SEQ ID NO:3, wherein upon 
exposure to a cellulase-producing host cell, said oligonucleotide decreases or inhibits the 
production of cellulase by said host cell. 

24. The antisense oligonucleotide of Claim 23. wherein the host cell is a filamentous fungi. 

25. A detergent composition, said composition comprising a polypeptide selected from the 
group consisting of: 
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(a) 



an amino acid sequence having at least 85% sequence identity to the amino 
acid sequence presented in Figure 3 (SEQ ID NO:3); 



(b) 



an amino acid sequence having at least 90% sequence identity to the amino 
acid sequence presented in Figure 3 (SEQ ID NO:3); 



5 



(c) 



an amino acid sequence having at least 95% sequence identity to the amino 



acid sequence presented in Figure 3 (SEQ ID NO:3); 

(d) an amino acid sequence presented in Figure 3 (SEQ ID NO:3); 

(e) a substantially purified biologically active fragment of the amino acid sequence 
presented as SEQ ID NO:3 

10 wherein the identity is determined by the CLUSTAL-W program in MacVector version 6.5, 
operated with default parameters, including an open gap penalty of 10.0, an extended gap 
penalty of 0.1 , and a BLOSUM 30 similarity matrix. 

26. A detergent composition comprising a surfactant and a cellulase according to Claim 15. 

15 

27. The detergent according to claim 25. wherein said detergent is a laundry detergent. 

28. The detergent according to claim 25, wherein said detergent is a dish detergent. 

20 29. A feed additive comprising a cellulase according to claim 15. 

30. A method of treating wood pulp comprising contacting said wood pulp with a cellulase 
according to claim 15. 

25 31 . A method of converting biomass to sugars comprising contacting said biomass with a 
cellulase according to claim 15. 

32. The method of Claim 31 further comprising the generation of high fructose corn-syrup 
30 33. A method of producing ethanol, said method comprising the steps of: 
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(a) 



(b) 
(c) 



contacting a biomass composition with an enzymatic composition comprising 
BagCel to yield a sugar solution; 

adding to the sugar solution a fermentative microorganism; and 

culturing the fermentative microorganism under conditions sufficient to produce 

ethanol, 



40 



33. A method of identifying novel enzymes comprising: 

(a) isolating total microbial community DNA from an environment; 

(b) constructing a genomic DNA library in E.co//; 

(c) screening the library for expression of cellulase activity; 
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(d) identifying the cellulase gene in the cellulase-positive clone; and 

(e) characterising the novel cellulase enzyme. 
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Figure 1A Nucleotide sequence of inserted environmental DNA 

TCTCATTTAG TTTGACTGAA ATACCTACTG TGCCACAAAG TAAAGTTAAA 50 

CTGACGAATG TGGAACGAAT CACTTAATGG TTCTAGCATA GATAACGAAA 100 

GATGAACACG TTCAAAGTTC GCCACTCTTT TGAAAGAGGG TGAACTTTTT 150 

TTGTGACAAG AAAGGGTGTT AAATGAAGAT CAAAGCTAAA CAAGATGAGT 200 

AACGTTTCTT TTCTTTTTTA TAGAGTGAGT TAGTATATGA GTCCCTTATA 25 0 

AATTTCTAGA CTGTTATTTT AAATAATTGA ATGACTCAGT CACCATTAAG 300 

TTTTCAACAC CCATAAGCGA CGTTTGAAGA TCTAATGATG CGAGAGGTTT 350 

TATCACTTTG GAGCGGAAGA TCACTGTAGG ACTCGTTTTA TATGGTGAAC 400 

TTGGTGTTAT TGTGTATTTA AAAGAAAGGG AAACGAAAAA AATGGTTAAA 450 

TTAGAT^GAG GCTATTACAG AGAGGAGAAC AAACAATGAA CGTAACACTT 500 

GAAGTGACAT ACTGCACGAC TAAAGGTATT CGAACAACCT TTCATTCAGA 550 

AGGTATGGAG GCCGAAAAAG CAATTACCAT CGCAGAAGAT TTTCAGCGGA 600 

CAGGACGGAT AAAACAGATC GTCTTTAGAG ATGAGCGTGA TAGTCCGTGG 650 

ACGTTAAAAG AACTTAAAAG ATTTTTAGAA GAGATTAAAA CGGAGCCGCA 700 

TCATCTCTCT GTGTATTTTG ATGGGGGATT TGATTTGGAG ACACAACGAT 750 

CTGGTCTTGG GTGTGATTTA TTATGAACAA AATGACACGT CTTATCGGGT 800 

GAGAAGAAAC GCTACCGTGG CGTCATTGAC ATCGAATAAC GAAGCAGAAT 850 

ATGCCGCTTT ACATTTAGGA CTTAAAGAAC TTGAAGGGAT CGGTGCGCAT 900 

CATCTACCTA TCACTATTTA CGGTGATTCT CAAGTTGTGA TCAATCAGTT 950 

AAAAGGAGAA TGGGCGTGTA TGGAGGAGGT GTTAAATAAA TGGGCTGACC 1000 

GTATTGATCA GCATTTAGCT AAATTAGGCA TGACCGCTAC TTATAAGTTA 1050 

ATCCCCCGTA AAGAAAACCG TGAAGCAGAT CAACTGGCTA CACAAGCGTT 1100 

AAACGGGCAA GAAATTATAA GTCAACGTGA TGTCAGTGAG CGTGGTGCAG 1150 

ATTAGTCTGC ACCCGCATAA AAGTTAACGT ATATAGAAGT GGATGGGGAT 1200 

TAAAGGAACG TCATTCACTC TAAGCAAGCG TTGCGACAGC AAAAAAGAAA 1250 

CATATAAGGT TTTTCTGAGC TACTATCTAT ACAAATAGCC AAGTGGCAGT 13 00 

TAAGCTCTTA CCTCATCAAG TTTTTGACTA CCAGTCTTCC ACTCCTACTT 1350 

TCACCTATAT AAATTGGTTC CTTTTTTGTT AATAATCACT AATTTTGACG 1400 

GTATTTTTTA ATAGAAATAT ATGCTAGATT ATAAACTAGT AACGATGTAG 14 50 

AAGGTGGTGA TTGACCATAT AAGAAGACTC TTTCAAACCT GGTAGTATCG 1500 

CATTAAAAAA TTTGAAAGGT GGAGAGGACA CATGGGTTAT ACCAAAGCGA 1550 

AGTGTACGTT GAAAAAAACT GTCTTGTTTG GTTTAATTCT CTGTTTAAGT 1600 

GTGTCAATGT TTGTTCCAAT GACATCAGCT GAAGATGTCA CTTCGTCACA 1650 

GTTGGATATT CACTCCTATG TAGCTGACAT GCAGCCTGGC TGGAATTTAG 1700 

GAAATACGTT TGACGCTGTT GGAGATGATG AAACAGCGTG GGGGAATCCT 1750 

CGTGTAACAA GAGAGTTAAT AAAAACGATT GCTGATGAAG GGTATAAAAG 1800 

CATTCGTATC CCAGTGACAT GGCAAAATCA AATGGGTGGT TCTCCAGATT 1850 

ATACGATAAA TGAAGATTAT ATCAATCGGG TGGAGCAAGC GATAGATTGG 1900 

GCGTTGGAGG AAGACTTATA TGTGATGTTA AATGTGCATC ATGACTCATG 1950 

GCTGTGGATG TATGATATGG AACATAACTA TGATGAGGTC ATGGCAAGAT 2000 

ATACAGCTAT TTGGGAACAA TTGTCGGAAA AATTCAAAAG CCACTCCCAT 2050 

AAGTTGATGT TTGAGAGTGT CAATGAGCCT AGGTTTACGC AGGAGTGGGG 2100 

AGAGATTCAA GAAAATCATC ATGCTTACTT AGAAGATTTA AATAAGACGT 2150 

TCTATTATAT TGTCAGAGAG TCAGGAGGCA ATAATGTGGA GCX3CCCTTTA 2200 

GTATTGCCTA CGATAGAAAC AGCCACGTCT CAGGATTTAC TAGATCGCTT 2250 

GTATCAAACA ATGGAAGACT TGGATGATCC TTATTTAATT GCCACGGTGC 2300 

ATTATTATGG CTTCTGGCCA TTTAGTGTCA ATATAGCAGG GTACACTCAT 2350 

TTTGAACAGG AAACACAACA AGATATTATA GACACCTTTG ACCGTGTTCA 2400 

TAACACATTT ACAGCGCGTG GTGTCCCAGT TGTATTAGGC GAATTCX3GTT 2450 
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TGTTAGGCTT TGACAAAAGT ACGGATGTGA TTCAGCAAGG GGAGAAATTA 2500 

AAGTTTTTTG AGTTTCTCAT CCATCATCTC AATGAACGTG ATATAACCCA 2550 

TATGTTATGG GATAACGGCC AGCATTTAAA TCGAGAAACT TATGCATGGT 2600 

^ ATGATCAAGA ATTTCATGAC ATATTAAAAG CGAGTTGGGA GGGGCGTTCT 2650 

GCTACAGCAG AGTCTAATTT GATTCATGTG AAGGACGGAA AGCCAATTAG 2700 

AGATCAAGAT ATACAGCTTT ACTTAAACGG AAATGAGCTA ACAGCCTTAC 2750 

AGGCAGGGGA GGAATCGCTT GTTCTAGGAG AGGATTATGA ACTAGCAGGA 2800 

GGCGTATTAA CGCTAAAAGC GGACACCCTC ACAAGACTAA TTACCCCTGG 2850 

TCAATTAGGA ACCAATGCAG TCATCACAGC ACAATTTAAT TCTGGAGCAG 2900 

ACTGGCGTTT TCAATTACAG AATGTGGACG TGCCAACGGT CGAAAATACA 2950 

GATGGCTCAA CATGGCATTT TGCGATCCCT ACCCATTTTA ATGGTGATAG 3000 

TCTTGCGACG ATGGAAGCTG TTTATGCAAA CGGAGAATAT GCTGGGCCGC 3050 

AAGATTGGAC GTCATTTAAA GAATTTGGCG AGGCGTTTTC TCCTAATTAC 3100 

GCCACAGGGG AAATTATTAT ATCAGAAGCC TTCTTTT^CG CGGTACGGGA 3150 

TGATGATATC CATTTAACAT TTCATTTTTG GAGCGGAGAG ACGGTGGAAT 3 200 

ATACCTTACG TAAAAATGGC AATTATGTTC AAGGTAGACG GTAACATGAT 3 250 

TTTAATTAAT AGATAAACCA GCCTACCTAT CGTTTTTGGA AGAAGGCAAA 33 00 

CGAATCTCAT CTTACCAACA CCGTGCTTTA GAACTTTAGA AGTGACGGTG 3350 

TTTTTTAAGA CATGAGGAGA AGCAATCCTC TATCAACAGT CACCAATTTT 3400 

TATTCAGGAG GTGTCAAGTT ATCTAACGTT CTATGAATGC ATATAGTCTC 3450 

TGACGAATAA ACATAGTTAA AAAGAAGTGA GCCTAGTCCC CGAGGGGAAG 3500 

GGGATAATGC CAACGTATTG GATTAAAGTA CCTTCTTGAT AAAAAGAAAG 3550 

GGTTTTCAAG AGATGGAAAT GGGCTCGTTT GTTATACTTT AATTACGCCT 3600 

TGGAACGTCA TTTTGGCGGT GCTATTTAGT AAGATGGCTG ACATCATAAA 3650 

AGAGGAGTGG GTTCGATGGC TTTAATTCAA TTAAGCTTTA AATCACGAGC 3700 

ATTAATGTTG CAAACCTCTG TCAATGTTTT ATTACCGGTG GGAATGAATG 3750 

CTGTAGATTT TACACCAAGT GATGATTTTT CTTATGTTAC TGACCCTTTT 3800 

CCTGTCCTAT ATCTTTTGCA TGGTGCAACT GATGATTATT CAGCATGGCT 3850 

ACGTCTGTCC TCTATTGAAC GATATGCTGA AGAAAAAAAA TTGGCGGTCG 3900 

TCATGCCAAA TGCTGATATG AGTGCGTATA CGGATATGGT ACATGGCCAT 3950 

CGTTACTGGA CGTATATTAG TAAGGTGCTG CCTGAGTTTA TGAGAGCAAC 4000 

TTTTCCTATT TCTCAGCACC GTGAAGAC7VC CTTTGCAGCT GGTCTGTCTA 4050 

TGGGAGGATA CGGGGCTTTT AAATTGGCGC TGCGGCAACC GGAACGCTTC 4100 

GCTGCAGCTG TGTCATTATC TGGTGCAGTT GATATGAGAG AAGCAAGTCA 4150 

ACCAGACTCC CTATTTGTAA ATGCCTTTGG TGAAGGGACG AAAATCGCAG 4200 

GGACA 4205 



r 
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Figure 2 ORF Nucleotide sequence of cellulase gene 

ATGGGTTATA CCAAAGCGAA GTGTACGTTG AAAAAAACTG TCTTGTTTGG 
TTTAATTCTC TGTTTAAGTG TGTCAATGTT TGTTCCAATG ACATCAGCTG'' 
AAGATGTCAC TTCGTCACAG TTGGATATTC ACTCCTATGT AGCTGACATG 
CAGCCTGGCT GGAATTTAGG AAATACGTTT GACGCTGTTG GAGATGATGA 
AACAGCGTGG GGGAATCCTC GTGTAACAAG AGAGTTAATA AAAACGATTG 
CTGATGAAGG GTATAAAAGC ATTCGTATCC CAGTGACATG GCAAAATCAA 
ATGGGTGGTT CTCCAGATTA TACGATAAAT GAAGATTATA TCAATCGGGT 
GGAGCAAGCG ATAGATTGGG CGTTGGAGGA AGACTTATAT GTGATGTTAA 
ATGTGCATCA TGACTCATGG CTGTGGATGT ATGATATGGA ACATAACTAT 
GATGAGGTCA TGGCAAGATA TACAGCTATT TGGGAACAAT TGTCGGAAAA 
ATTCAAAAGC CACTCCCATA AGTTGATGTT TGAGAGTGTC AATGAGCCTA 
GGTTTACGCA GGAGTGGGGA GAGATTCAAG AAAATCATCA TGCTTACTTA 
GAAGATTTAA ATAAGACGTT CTATTATATT GTCAGAGAGT CAGGAGGCAA 
TTU^TGTGGAG CGCCCTTTAG TATTGCCTAC GATAGAAACA GCCACGTCTC 
AGGATTTACT AGATCGCTTG TATCAAACAA TGGAAGACTT GGATGATCCT 
TATTTAATTG CCACGGTGCA TTATTATGGC TTCTGGCCAT TTAGTGTCAA 
TATAGCAGGG TACACTCATT TTGAACAGGA AACACAACAA GATATTATAG 
ACACCTTTGA CCGTGTTCAT AACACATTTA CAGCGCGTGG TGTCCCAGTT 
GTATTAGGCG AATTCGGTTT GTTAGGCTTT GACAAAAGTA CGGATGTGAT 
TCAGCAAGGG GAGAAATTAA AGTTTTTTGA GTTTCTCATC CATCATCTCA 
ATGAACGTGA TATAACCCAT ATGTTATGGG ATAACGGCCA GCATTTAAAT 
CGAGAAACTT ATGCATGGTA TGATCAAGAA TTTCATGACA TATTAAAAGC 
GAGTTGGGAG GGGCGTTCTG CTACAGCAGA GTCTAATTTG ATTCATGTGA 
AGGACGGAAA GCCAATTAGA GATCAAGATA TACAGCTTTA CTTAAACGGA 
AATGAGCTAA CAGCCTTACA GGCAGGGGAG GAATCGCTTG TTCTAGGAGA 
GGATTATGAA CTAGCAGGAG GCGTATTAAC GCTAAAAGCG GACACCCTCA 
CAAGACTAAT TACCCCTGGT CT^TTAGGAA CCAATGCAGT CATCACAGCA 
CAATTTAATT CTGGAGCAGA CTGGCGTTTT CAATTACAGA ATGTGGACGT 
GCCAACGGTC GAAAATACAG ATGGCTCAAC ATGGCATTTT GCGATCCCTA 
CCCATTTTAA TGGTGATAGT CTTGCGACGA TGGAAGCTGT TTATGCAAAC 
GGAGAATATG CTGGGCCGCA AGATTGGACG TCATTTAAAG AATTTGGCGA 
GGCGTTTTCT CCTAATTACG CCACAGGGGA AATTATTATA TCAGAAGCCT 
TCTTTAACGC GGTACGGGAT GATGATATCC ATTTAACATT TCATTTTTGG 
AGCGGAGAGA CGGTGGAATA TACCTTACGT AAAAATGGCA ATTATGTTCA 
AGGTAGACGG TAA 
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Figure 3 Amino acid sequence of BagCel cellulase 



MGYTKAKCTL KKTVLFGLIL CLSVSMFVPM TSAEDVTSSQ LDIHSYVADM 50 

• QPGWNLGNTF DAVGDDETAW GNPRVTRELI KTIADEGYKS IRIPVTWQNQ 100 

MGGSPDYTIN EDYINRVEQA IDWALEEDLY VMLNVHHDSW LWMYDMEHNY 150 

DEVMARYTAI WEQLSEKFKS HSHKLMFESV NEPRFTQEWG EIQENHHAYL 200 

T EDLNKTFYYI VRESGGNNVE RPLVLPTIET ATSQDLLDRL YQTMEDLDDP 250 

YLIATVHYYG FWPFSVNIAG YTHFEQETQQ DIIDTFDRVH NTFTARGVPV 300- 

VLGEFGLIiGF DKSTDVIQQG EKLKFFEFLI HHLNERDITH MLWDNGQHLN 350 

RETYAWYDQE FHDILKASWE GRSATAESNL IHVKDGKPIR DQDIQLYLNG 40 0 

NELTALQAGE ESLVLGEDYE LAGGVLTLKA DTLTRLITPG QLGTNAVITA 450 

QPNSGADWRF QLQNVDVPTV ENTDGSTWHF AIPTHFNGDS LATMEAVYAN 500 

GEYAGPQDWT SFKEFGEAFS PNYATGEIII SEAFFNAVRD DDIHLTFHFW 550 

SGETVEYTLR KNGNYVQGRR 570 



WO 2004/099369 



4/4 



JC17Rec'dPCT/PT0 20 SEP 2005 



THIS PAGE BlMK 



(USPTO) 



